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Figure 1 

CGAGAGACGACAGAAGGTTACGGCTGCGAGAAGACGACAGAAGGGTCCAGAAAAA 

GGAAAGTGCTGGAGGGGAGTGGGGACAAAAGCAGCGACCAAGTGAATGTCACTTC 

AGTGACTGAGGCCAGGCAAAACGCGCGGGAAGGATTTTGTGTAGCTTGGGACCCTT 

TCATAGACACTGATGACACGTTTACGCAAAATAGAAATTTGAGGAGAAACGCCTGG 

GCCTTCGGAAAGGAGTGATTGATTAGTACTTGCAAGTTTAGGTGACTTTAAGGAGAA 

CTAACTAATGTATACTATTGAGGGAGGAGGAAGAGCATTACAGAGTTTCCAGCAGC 

AGCAGGAAAGCTTTGGTTAATTTGGAAATGGATGATAGCATTAAAATAACAGAAGC 

GCCTCCAGGTCTCTGAAGCTTCAGTCCCCCAGCTGAAAGCCAGAAAAGACTAAGCC 

CACTAAGCCTTTTGATCCCTTTGGAAGCAAAGAACTTTCCTTCCCTGGGGTGAAGAC 

TCTCCTCAGAAGATTTCCTGTCTCTGCCTATGTTACAAGAGGAATCAAAACCAAGAC 

AGAAGAGCTCAGGATGCAGGTGAGAGGCAGGGAAGTCAGCGGCTTGTTGATCTCCC 

TCACTGCTGTCTGCCTGGTGGTCACCCCTGGGAGCAGGGCCTGTCCTCGCCGCTGTG 

CCTGCTATGTGCCCACAGAGGTGCACTGTACATTTCGGTACCTGACCTCCATCCCAG 

ATGGCATCCCGGCCAATGTGGAACGAATAAATTTAGGATATAACAGCCTTACTAGAT 

TGACAGAAAACGACTTTGATGGCCTGAGCAAACTGGAGTTACTCATGCTGCACAGT 

AATGGCATTCACAGAGTCAGTGACAAGACCTTCTCGGGCTTGCAGTCCTTGCAGGTC 

TTAAAAATGAGCTATAACAAAGTCCAAATCATTCGGAAGGATACTTTCTACGGACTC 

GGGAGCTTGGTCCGGTTGCACCTGGATCACAACAACATTGAATTCATCAACCCTGAG 

GCCTTTTATGGACTTACCTCGCTCCGCTTGGTACATTTAGAAGGAAACCGGCTCACA 

AAGCTCCATCCAGACACATTTGTCTCATTAAGCTATCTCCAGATATTTAAAACCTCTT 

TCATTAAGTACCTGTTCTTGTCTGATAACTTCCTGACCTCCCTCCCAAAAGAAATGGT 

CTCCTACATGCCAAACCTAGAAAGCCTGTATTTGCATGGAAACCCATGGACCTGTGA 

CTGCCATTTAAAGTGGTTGTCTGAGTGGATGCAGGGAAACCCAGATATAATAAAAT 

GCAAGAAAGACAGAAGCTCTTCCAGTCCTCAGCAATGTCCCCTTTGCATGAACCCCA 

GGATCTCTAAAGGCAGACCCTTTGCTATGGTACCATCTGGAGCTTTCCTATGTACAA 

AGCCAACCATTGATCCATCACTGAAGTCAAAGAGCCTGGTTACTCAGGAGGACAAT 

GGATCTGCCTCCACCTCACCTCAAGATTTCATAGAACCCTTTGGCTCCTTGTCTTTGA 

ACATGACANANNTNTCTGGAAATAAGGCCGACATGGTCTGTAGTATCCAAAAGCCA 

TCAAGGACATCACCAACTGCATTCACTGAAGAAAATGACTACATCATGCTAAATGC 

GTCATTTTCCACAAATCTTGTGTGCAGTGTAGATTATAATCACATCCAGCCAGTGTG 

GCAACTTCTGGCTTTATACAGTGACTCTCCTCTGATACTAGAAAGGAAGCCCCAGCT 

TACCGAGACTCCTTCACTGTCTTCTAGATATAAACAGGTGGCTCTTAGGCCTGAAGA 

CATTTTTACCAGCATAGAGGCTGATGTCAGAGCAGACCCTTTTTGGTTCCAACAAGA 

AAAAATTGTCTTGCAGCTGAACAGAACTGCCACCACACTTAGCACATTACAGATCCA 

GTTTTCCACTGATGCTCAAATCGCTTTACCAAGGGCGGAGATGAGAGCGGAGAGAC 

TCAAATGGACCATGATCCTGATGATGAACAATCCCAAACTGGAACGCACTGTCCTGG 

TTGGCGGCACTATTGCCCTGAGCTGTCCAGGCAAAGGCGACCCTTCACCTCACTTGG 

AATGGCTTCTAGCTGATGGGAGTAAAGTGAGAGCCCCTTACGTTAGCGAGGATGGG 
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CGAATCCTAATAGACAAAAATGGGAAGTTGGAACTGCAGATGGCTGACAGCTTTGA 

TGCAGGTCTTTACCACTGCATAAGCACCAATGATGCAGATGCGGATGTTCTCACATA 

CAGGATAACTGTGGTAGAGCCCTATGGAGAAAGCACACATGACAGTGGAGTCCAGC 

ACACAGTGGTTACGGGTGAGACGCTCGACCTTCCATGCCTTTCCACGGGTGTTCCAG 

ATGCTTCTATTAGCTGGATTCTTCCAGGGAACACTGTGTTCTCTCAGCCATCAAGAG 

ACAGGCAAATTCTTAACAATGGGACCTTAAGAATATTACAGGTTACGCCAAAAGAT 

CAAGGTCATTACCAATGTGTGGCTGCCAACCCATCAGGGGCCGACTTTTCCAGTTTT 

AAAGTTTCAGTTCAAAAGAAAGGCCAAAGGATGGTTGAGCATGACAGGGAGGCAG 

GTGGATCTGGACTTGGAGAACCCAACTCCAGTGTTTCCCTTAAGCAGCCAGCATCTT 

TGAAACTCTCTGCATCAGCTTTGACAGGGTCAGAGGCTGGAAAACAAGTCTCCGGTG 

TACATAGGAAGAACAAACATAGAGACTTAATACATCGGCGGCGTGGGGATTCCACG 

CTCCGGCGATTCAGGGAGCATAGGAGGCAGCTCCCTCTCTCTGCTCGGAGAATTGAC 

CCGCAACGCTGGGCAGCACTTCTAGAAAAAGCCAAAAAGAATTCTGTGCCAAAAAA 

GCAAGAAAATACCACAGTAAAGCCAGTGCCACTGGCTGTTCCCCTCGTGGAACTCA 

CTGACGAGGAAAAGGATGCCTCTGGCATGATTCCTCCAGATGAAGAATTCATGGTTC 

TGAAAACTAAGGCTTCTGGTGTCCCAGGAAGGTCACCAACTGCTGACTCTGGACCAG 

TAAATCATGGTTTTATGACGAGTATAGCTTCTGGCACAGAAGTCTCAACTGTGAATC 

CACAAACACTACAATCTGAGCACCTTCCTGATTTCAAATTATTTAGTGTAACAAACG 

GTACAGCTGTGACAAAGAGTATGAACCCATCCATAGCAAGCAAAATAGAAGATACA 

ACCAACCAAAACCCAATCATTATCTTTCCATCAGTAGCTGAAATTCGAGATTCTGCT 

CAGGCAGGAAGAGCATCTTCCCAAAGTGCACACCCTGTAACAGGGGGAAACATGGC 

TACCTATGGCCATACCAACACATATAGTAGCTTTACCAGCAAAGCCAGTACAGTCTT 

GCAGCCAATAAATCCAACAGAAAGTTATGGACCTCAGATACCTATTACAGGAGTCA 

GCAGACCTAGCAGTAGTGACATCTCTTCTCACACTACTGCAGACCCTAGCTTCTCCA 

GTCACCCTTCAGGTTCACACACCACTGCCTCGTCTTTATTTCACATTCCTAGAAACAA 

CAATACAGGTAACTTCCCCTTGTCCAGGCACTTGGGAAGAGAGAGGACAATTTGGA 

GCAGAGGGAGAGTTAAAAACCCACATAGAACCCCAGTTCTCCGACGGCATAGACAC 

AGGACTGTGAGGCCAGCAATCAAGGGACCTGCTAACAAAAATGTGAGCCAAGTTCC 

AGCCACAGAGTACCCTGGGATGTGCCACACATGTCCTTCCGCAGAGGGGCTCACAG 

TGGCTACTGCAGCACTGTCAGTTCCAAGTTCATCCCACAGTGCCCTCCCCAAAACTA 

ATAATGTTGGGGTCATAGCAGAAGAGTCTACCACTGTGGTCAAGAAACCACTGTTAC 

TATTTAAGGACAAACAAAATGTAGATATTGAGATAATAACAACCACTACAAAATAT 

TCCGGAGGGGAAAGTAACCACGTGATTCCTACGGAAGCAAGCATGACTTCTGCTCC 

AACATCTGTATCCCTGGGGAAATCTCCTGTAGACAATAGTGGTCACCTGAGCATGCC 

TGGGACCATCCAAACTGGGAAAGATTCAGTGGAAACAACACCACTTCCCAGCCCCC 

TCAGCACACCCTCAATACCAACAAGCACAAAATTCTCAAAGAGGAAAACTCCCTTG 

CACCAGATCTTTGTAAATAACCAGAAGAAGGAGGGGATGTTAAAGAATCCATATCA 

ATTCGGTTTACAAAAGAACCCAGCCGCAAAGCTTCCCAAAATAGCTCCTCTTTTACC 

CACAGGTCAGAGTTCCCCCTCAGATTCTACAACTCTCTTGACAAGTCCGCCACCAGC 

TCTGTCTACAACAATGGCTGCCACTCAGAACAAGGGCACTGAAGTAGTATCAGGTG 
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CCAGAAGTCTCTCAGCAGGGAAGAAGCAGCCCTTCACCAACTCCTCTCCAGTGCTTC 

CTAGCACCATAAGCAAGAGATCTAATACATTAAACTTCTTGTCAACGGAAACCCCCA 

CAGTGACAAGTCCTACTGCTACTGCATCTGTCATTATGTCTGAAACCCAACGAACAA 

GATCCAAAGAAGCAAAAGACCAAATAAAGGGGCCTCGGAAGAACAGAAACAACGC 

AAACACCACCCCCAGGCAGGTTTCTGGCTATAGTGCATACTCAGCTCTAACAACAGC 

TGATACCCCCTTGGCTTTCAGTCATTCCCCACGACAAGATGATGGTGGAAATGTAAG 

TGCAGTTGCTTATCACTCAACAACCTCTCTTCTGGCCATAACTGAACTGTTTGAGAA 

GTACACCCAGACTTTGGGAAATACAACAGCTTTGGAAACAACGTTGTTGAGCAAAT 

CACAGGAGAGTACCACAGTGAAAAGAGCCTCAGACACACCACCACCACTCCTCAGC 

AGTGGGGCGCCCCCAGTGCCCACTCCTTCCCCACCTCCTTTTACTAAGGGTGTGGTT 

ACAGACAGCAAAGTCACATCAGCTTTCCAGATGACGTCAAATAGAGTGGTCACCAT 

ATATGAATCTTCAAGGCACAATACAGATCTGCAGCAACCCTCAGCAGAGGCTAGCC 

CCAATCCTGAGATCATAACTGGAACCACTGACTCTCCCTCTAATCTGTTTCCATCCAC 

TTCTGTGCCAGCACTAAGGGTAGATAAACCACAGAATTCTAAATGGAAGCCCTCTCC 

CTGGCCAGAACACAAATATCAGCTCAAGTCATACTCCGAAACCATTGAGAAGGGCA 

AAAGGCCAGCAGTAAGCATGTCCCCCCACCTCAGCCTTCCAGAGGCCAGCACTCAT 

GCCTCACACTGGAATACACAGAAGCATGCAGAAAAGAGTGTTTTTGATAAGAAACC 

TGGTCAAAACCCAACTTCCAAACATCTGCCTTACGTCTCTCTACCTAAGACTCTATTG 

AAAAAGCCAAGAATAATTGGAGGAAAGGCTGCAAGCTTTACAGTTCCAGCTAATTC 

AGACGTTTTTCTTCCTTGTGAGGCTGTTGGAGACCCACTGCCCATCATCCACTGGACC 

AGAGTTTCATCAGGANTTGAAATATCCCAAGGGACACAGAAAAGCCGGTTCCACGT 

GCTTCCCAATGGCACCTTGTCCATCCAGAGGGTCAGTATTCAGGACCGTGGACAGTA 

CCTGTGCTCTGCATTTAATCCACTGGGCGTAGACCATTTTCATGTCTCTTTGTCTGTG 

GTTTTTTACCCGGCAAGGATTTTGGACAGACATGTCAAGGAGATCACAGTTCACTTT 

GGAAGTACTGTGGAACTAAAGTGCAGAGTGGAGGGTATGCCGAGGCCTACGGTTTC 

CTGGATACTTGCAAACCAAACGGTGGTCTCAGAAACGGCCAAGGGAAGCAGAAAGG 

TCTGGGTAACACCTGATGGAACATTGATCATCTATAATCTGAGTCTTTATGATCGTG 

GTTTTTACAAGTGTGTGGCCAGCAACCCATCTGGCCAGGATTCACTGTTGGTTAAGA 

TACAAGTCATCACAGCTCCCCCTGTCATTATAGAGCAAAAGAGGCAAGCCATCGTTG 

GGGTTTTAGGTGGAAGTTTGAAACTGCCCTGCACTGCAAAAGGAACTCCCCAGCCTA 

GTGTTCACTGGGTCCTTTATGATGGGACTGAACTAAAACCATTGCAGTTGACTCATT 

CCAGATTTTTCTTGTATCCAAATGGAACTCTGTATATAAGAAGCATCGCTCCTTCAGT 

GAGGGGCACTTATGAGTGCATTGCCACCAGCTCCTCAGGCTCAGAGAGAAGGGTAG 

TGATTCTTACTGTGGAAGAGGGAGAGACAATCCCCAGGATAGAAACTGCCTCTCAG 

AAATGGACTGAGGTGAATTTGGGTGAGAAATTACTACTGAACTGCTCAGCTACTGG 

GGATCCAAAGCCTAGAATAATCTGGAGGCTGCCATCCAAGGCTGTCATCGACCAGT 

GGCACAGAATGGGCAGCCGAATCCACGTCTACCCAAATGGATCCTTGGTGGTTGGG 

TCAGTGACGGAAAAAGACGCTGGTGACTACTTATGTGTGGCAAGAAACAAAATGGG 

AGATGACCTAGTCCTGATGCATGTCCGCCTGAGATTGACACCTGCCAAAATTGAACA 

GAAGCAGTATTTTAAGAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTTGACTGCA 
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AGGCCTCTGGCTCCCCTGTGCCTGAGGTATCCTGGAGTTTGCCTGATGGGACAGTGC 

TCAACAATGTAGCCCAAGCTGATGACAGTGGCTATAGGACCAAGAGGTACACCCTT 

TTCCACAATGGAACCTTGTATTTCAACAACGTTGGGATGGCAGAGGAAGGAGATTAT 

ATCTGCTCTGCCCAGAACACCTTAGGGAAAGATGAGATGAAAGTCCACCTAACAGT 

TCTAACAGCCATCCCACGGATAAGGCAAAGCTACAAGACCACCATGAGGCTCAGGG 

CTGGAGAAACAGCTGTCCTTGACTGCGAGGTCACTGGGGAACCGAAGCCCAATGTA 

TTTTGGTTGCTGCCTTCCAACAATGTCATTTCATTCTCCAATGACAGGTTCACATTTC 

ATGCCAATAGAACTTTGTCCATCCATAAAGTGAAACCACTTGACTCTGGGGACTATG 

TGTGCGTAGCTCAGAATCCTAGTGGGGATGACACTAAGACATACAAACTGGACATT 

GTCTCTAAACCTCCATTAATCAATGGCCTGTATGCAAACAAGACTGTTATTAAAGCC 

ACAGCCATTCGGCACTCCAAAAAATACTTTGACTGCAGAGCAGATGGGATCCCATCT 

TCCCAGGTCACGTGGATTATGCCAGGCAATATTTTCCTCCCAGCTCCATACTTTGGA 

AGCAGAGTCACGGTCCATCCAAATGGAACCTTGGAGATGAGGAACATCCGGCTTTC 

TGACTCTGCGGACTTCACCTGTGTGGTTCGGAGCGAGGGAGGAGAGAGTGTGTTGGT 

AGTGCAGTTAGAAGTCCTAGAAATGCTGAGAAGACCAACATTCAGAAACCCATTCA 

ACGAAAAAGTCATCGCCCAAGCTGGCAAGCCCGTAGCACTGAACTGCTCTGTGGAT 

GGGAACCCCCCACCTGAAATTACCTGGATCTTACCTGACGGCACACAGTTTGCTAAC 

AGACCACACAATTCCCCGTATCTGATGGCAGGCAATGGCTCTCTCATCCTTTACAAA 

GCAACTCGGAACAAGTCAGGGAAGTATCGCTGTGCAGCCAGGAATAAGGTTGGCTA 

CATCGAGAAACTCATCCTGTTAGAGATTGGGCAGAAGCCAGTCATTCTGACATACGA 

ACCAGGGATGGTGAAGAGCGTCAGTGGGGAACCGTTATCACTGCATTGTGTGTCTG 

ATGGGATCCCCAAGCCAAATGTCAAGTGGACTACACCGGGTGGCCATGTAATCGAC 

AGGCCTCAAGTGGATGGAAAATACATACTGCATGAAAATGGCACGCTGGTCATCAA 

AGCAACAACAGCTCACGACCAAGGAAATTATATCTGTAGGGCTCAAAACAGTGTTG 

GCCAGGCAGTTATTAGCGTGTCAGTGATGGTTGTGGCCTACCCTCCCCGAATCATAA 

ACTACCTACCCAGGAACATGCTCAGGAGGACAGGGGAAGCCATGCAGCTCCACTGT 

GTGGCCTTGGGAATCCCCAAGCCAAAAGTCACCTGGGAGACGCCAAGACACTCCCT 

GCTCTCAAAAGCAACAGCAAGAAAACCCCATAGAAGTGAGATGCTTCACCCACAAG 

GTACGCTGGTCATTCAGAATCTCCAAACCTCGGATTCCGGAGTCTATAAGTGCAGAG 

CTCAGAACCTACTTGGGACTGATTACGCAACAACTTACATCCAGGTACTCTGACAGG 

AAGGGGGAGACTAAAATTCAACAGAAGTCCACATCCACAGGGTTTATTTTTTGGAA 

GAAGTTTAATCAAAGGCAGCCATAGGCATGTAAATGAGTCTGAATACATTTACAGT 

ATTAAATTTACAATGGACATGCGATGAGACTTGTAAATGAAAGCATTGTGAACTGA 

AACCGAGTCTCTGTGGATCTCAAAGCAAACTCTTAACTTAAGGCACTTTGATTTTGC 

CAACAAATAATAACAAACATTAAGAGAAAAAAATGATCCACTACGAAATAACAAAC 

GGCTAATGCACCTGAATTCTCAGTAAAAAGACCTTTCTCTCGCTAACAGTTGCCAGC 

TGCCTCGTGTCTGTTTCCTACCAATGTCACAAACATCGCACACAGGGTGAATGGAGT 

CAACGGGAAAGATTAAGTTTGCGGTCTGTGTAAATCTCAATGTACAAATATTCTGTC 

NCTGGTTTATAAACATTTTGATAAAACCGAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAA (SEQ ID NO: 1) 
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Figure 2 




EcoRI (5579) 
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Figure 3 

MQVRGREVSGLLISLTAVCLWTPGSRACPRRCACYVPTEVHCTFRYLTSIPDGIPANVE 

RINLGYNSLTRLTE^FDGLSKLELLMLHSNGIHRVSDKTFSGLQSLQVLKMSYNKVQn 

RKDTFYGLGSLVRLHLDHNNffiFINPEAFYGLTSLRLVHLEGNRLTKLHPDTFVSLSYLQI 

FKTSFIKYIJT^DNFLTSIPKEMVSYMPNLESLYLHGNPWTCDCI^ 

IKCKKDRSSSSPQQCPLCMNPRISKGRPFA3V1VPSGAFLCTKPTIDPSLKSKSLVTQEDNGS 

ASTSPQDFffiPFGSI^LNMTXXSGNKADMVCSIQKPSRTSPTAFre^ 

VCSVDYNfflQPVWQLLALYSDSPLILERKPQLTETPSLSSRYKQVALRPEDIFTSIEADVR 

ADPF^QQEKWLQLNRTATTI^TLQIQFSTDAQIALPRAEMRAERLKWTMILMMNNPK 

LERTVLVGGTIALSCPGKGDPSPHLEWLLADGSKVRAPYVSEDGRILIDKNGKLELQMA 

DSFDAGLYHCISTMDADADVLTYRITVVEPYGESTHDSGVQHTWTGETIJDLPCLSTGV 

PDASISWILPGNTVFSQPSRDRQILNNGTLRILQVTPKDQGHYQCVAANPSGADFSSFKV 

SVQKKGQRMVEHDREAGGSGLGEPNSSVSLKQPASLKLSASALTGSEAGKQVSGVHRK 

NKHRDLIHRRRGDSTLRRFREHRRQLPLSARRBDPQRWAALLEKAKKNSVPKICQENTTV 

KPVPLAVPLVELTDEEKDASGMIPPDEEFMVLKTKASGVPGRSPTADSGPVNHGFMTSI 

ASGTEVSTVNPQTLQSEHLPDFKLFSVTNGTAVTKSMNPSIASKIEDTTNQNPIIIFPSVAE 

IRDSAQAGRASSQSAHPVTGGNMATYGHTNTYSSFTSKASTVLQPINPTESYGPQIPITGV 

SRPSSSDISSHITADPSFSSHPSGSHTTASSIJF1OT 

KNPHRTPVLRRHRmTWPAIKGPANKNVSQVPATEYPGMCHTCPSAEGLTVATAALS 

VPSSSHSALPKTNN^GVIAEESTTVVKKPLLLFKDKQNVDffillTTTTKYSGGESNHVIPTE 

ASMTSAPTSVSLGKSPVDNSGHLSMPGTIQTGKDSVETTPLPSPIJSTPSIPTSTKFSKRKTP 

LHQIFVNNQKEEGMLKNPYQFGLQKNPAAKLPmPLLPTGQSSPSDSTTLLTSPPPALST 

TMAATQNKGTEVVSGARSLSAGKKQPFTNSSPVIJSTISKRSNTLNFLSTETPTVTSPTAT 

ASVIMSETQRTRSKEAKDQIKGPPvKNRNNANTrPRQVSGYSAYSALTTADTPLAFSHSP 

RQDDGGNVSAVAYHSTTSLLAITELFEKYTQTLGNTTALETTLLSKSQESTTVKRASDTP 

PPLLSSGAPPVPTPSPPPFTKGWTDSKVTSAFQMTSNRVVTIYESSRHNTDLQQPSAEAS 

PNPEIITGTTDSPSNLFPSTSVPALRVDKPQNSKWKPSPWPEHKYQLKSYSETrEKGKRPA 

VSMSPHLSLPEASTHASHWNTQKHAEKSVFDKKPGQl^^SKHLPYVSLPKTLLKPCPRnG 

GKAASFTWANSDVFLPCEAVGDPIJPIIHWTRVSSGXEISQGTQKSRFHVLPNGTLSIQRV 

SIQDRGQYLCSAFNPLGVDHFHVSLSVVFYPARILDRHVKEITVHFGSTVELKCRVEGMP 

RPTVSWILANQTVVSETAKGSRKVWVTPDGTLnYNLSLYDRGFYKCVASNPSGQDSLL 

VKIQVITAPPVIffiQKRQAIVGVLGGSLKLPCTAKGTPQPSVHWVLYDGTELKPLQLTHS 

RFFLYPNGTLYIRSIAPSVRGTYECIATSSSGSERRVVILTVEEGETIPRIETASQKWTEVN 

LGEKLLLNCSATGDPKPPJIWRLPSKAVIDQWHRMGSRIHVYPNGSLVVGSVTEKDAGD 

YLCVARNKMGDDLVLMHVRLRLTPAK1EQKQYFKKQVLHGKDFQVDCKASGSPWEV 

SWSLPDGWLNNVAQADDSGYRTKRYTIJHNGTLYFNNVGMAEEGDYICSAQNTLGK 

DEMKVHLTVLTAIPRIRQSYKTTMRLRAGETAVLDCEVTGEPKPNVFWLLPSNNVISFS 

NDRFTFHANRTLSIHKVKPLDSGDYVCVAQNPSGDDTKTYKLDiVSKPPLINGLYANKT 

VIKATAmHSKKYFDCRADGIPSSQVTWIMPGNIFLPAPYFGSRVTVHPNGTLEMRNIRLS 
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DSADFTCVVRSEGGESVLVVQLEVLEMLRRPTFRNPFNEKVIAQAGKPVALNCSVDGNP 

PPEITWILPDGTQFANRPHNSPYLMAGNGSLILYKATRNKSGKYRCAARNKVGYIEKLIL 

LEIGQKPVILTYEPGMVKSVSGEPLSLHCVSDGIPKPNVKWTTPGGHVIDRPQVDGKYIL 

HENGTLVIKATTAHDQGNYICRAQNSVGQAVISVSVMVVAYPPRIINYIPRmiLRRTGE 

AMQLHCVALGIPKPKVTWETPRHSLLSKATARKPHRSEMLHPQGTLVIQNLQTSDSGVY 

KCRAQNLLGTDYATTYIQVL (SEQ ID NO: 2) 
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Figure 5 



450 




ATO(jfflcS73) 



Rst-608 5'UTR 



Rat-608ORF(7791bp) 



^ Homology 



423 



region 




B*c23-261L4 
(nucl-5'end) 



Figure 6 
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Figure 7 

Nncl 



I^TGGAACCAACCCAGATGCCm^ 

TCCAATGGAATACTACICAACTTATTAAAAACAACC^ 

GNATGGTCTGNAGGATCITGAaTGAGGTAACC^ 

ACTCACroATAAQTGGCTATTTGTCTATGmGTCUm 
TTnTGTGTGTATAATATTAAGATGGAAATTTCKX^GTGCro 
TGTTTCAGTGGATTACCGGAAGAAAATAATAAGTGAACTGTAG 

•|Nw390 

AGGTGACAAAQTATCXTIXlACACATTGGGAAAAGACCAr: AGTCCAGGA A ACTGAOTCTTA A 

GQATTCAIATrMCTCCAQTC 

AACTTCACrACXJrTCCT^^^ 

pCATTAAGCTGCACTOAAAC^ 

ffiAAAOTAK^ccAAAG^ 

GATOGCXrTTI^GTTTTGAAA^^ 
ATTTAAgCCra CA^ 

TCl^AAAMGACTTrOGTATGCATAGGATCTAAATGTAA ATGTCATTTACTOAnAT'A^T 

AAATAGqA<^Cm<X7rAOAAT^^ 

A ? A ^ff ATATA A^rcTaQOAAA<ra^^ 

G ATGCCATATTTCCAAGfir, A AfiTGTtX^AI^TiryY^ACATOTT A AO A AlvmiCT^CTCTPT^ 
TTOACTCTTTAATTCTAni pTTtiATCATrAATGACT irro 

- ^™^ ta ™ctaat^ 
^ A ^?J AA ^ T A * A AA ^ A ^^^ 

AA *ff^ A *^ TT ™mAAOAC^ 

AA c AA ^rcASAACAMC^^^ 

fiAAAAA^AreA TATOTI^^ 

-^^aa^^a^ 

ffTA^CTO^A^ 

GGAAGAGTACATTCGAAA^AGACAC^ATTtrrOTAG^TOTAf^ 

AJaC^OfflCA AAA^^ 
C^CirnWT ATAqCATTOQA^ 

AAAAAAAAAAAAAAAAAAAAACK1AAGGTCAGACA 
GCAAACAGAAGGGGAGTCACAAACCCAGGAC^AACCACAGTG^ 

CTATGACATGTITlTITATrcAAGTATATCACATGCTGTtl^GCATA 
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GCC(XA(XXXXXriTrQCCCCG<3^ 
CACACACACACACACACACACACAOTITr^^ 
CACCCAAQCTTAAlXXTTTTCmCOCTGC^ 
CCCCCTCCaXTGTTAGGAGATGGGAGAGGTCA^ 

GGAGGAGCAAACTCXK^GOCX3AGCCrGGGTTGGCGCT^ 

cccATrcccAGAAGCcoAGGCTATAGCTGCGTC^ 

ACaXTAOAGOCimQCAGACCCCAAAQGCCOTGCAATT^ 
OAGaWCTGCTMTCCCCrCaGQAAGTG^ 

iExoa 1 (Nuc 3661) 

GeTCXKK?TTCA(K30a}AGACAGAOGG^ 
ACTGGAGGCKSAGTTGGGACAAAAGCAGCGACCAAGK^ 
GGCAAAAGGA<XX^AAGGATTATATCTA<XX^ 
TlXrlGCAAAiGCAAGCAATITGAGG^ 

AGTACTrGTAAGTTTA^TGAGTlTGAGAACTAACrAACXn'ATACTATTGAGGGAGAAGGAAG 
AGCATTCCAGCaG<>GCAGKaGCA^^ 

ATGTATt5ATAC^TTAAAATAACAGAAC<X>CCTCCAGTXX7rC^ 
GTGAAGACTAAGCCTACTAAGCCTITTGCTC^^ 
TGAAGG<^rCTC<nX^GAAGATTIXXrityiC^ 
AGAAGAjGCK^ GOTATTGOCAACTCTTT 



4 N«c4203 

AGl 



TGTTAAATACAGTTTGAGGCTTAAGTOTACGGGAAC^ 
CTIXTTCTT ATAACTAACTCTTAAOGrGC^^ 
GATCHTTGTTTA TCTAATAATAGCAA<X?TCATC T^ 

CAAAAATATTCAGTGATGTAACAGTOACAGTGTAGGCATAGAAGTAAT^^ 

TTAATirGGGTTAAACTCATTCATAACAGCTCCAGGTrqGGAGGGATCACTC 

CACGTGCXSGGTTAAAGATATTTTCTAACAAGAGAAGCAGAATT^ 

CCATCACTGTGTCAGTAAGCAGAGGGGTGTTTCCAAGCAGAGAAAGAGCAGACAGTGTTA 

T<KXnxX^AAmCAGAGACTCAGCCC^ 

AGTTT^CTCrGAAAAGG<XCATGTGTCCTTATTGGCAA 

GAAATTTGAOCITrTTTTCrAGTGGGTTATTACAGCTGTA A A AGTATnTGGAAGGTTA AG 

CCAAATAAATAAAACACATATTAAATAATACAATGTTACAAAAATTGATCATATAAAQAA 

QTACATTCATAAATGCAATGTCAAAAATATATATAATT 

TTTTCTAAATTGCACAIXjTACTATTTTTATATTTATAAAAATATTT^ 

AGTGTAAAAGOCTCTTGGTCAAACAAQAQAGTTA^ 

AACATTATTATGATCTCTAATGACAGGGATCCTGCTTTTrATTG 

AAQATATGTTTAC^TAATAAGCCCATTTAGTGATAAAGTCCAATO 

ACTGGTrTATAAAGAGAACAGTTTCCTGAGTCTATGCAAGTra 

AOTTCCTCTTTCKXXGATTTCACrAG^ 

CATTATAAOT<KiAATAAaATAGGATCTCAAAGGAATO 

CTT^GQTCTnxnTTC^GTTCACTA^ 

TAATTTirTTCXAOTTCrTATq^ 

ACCTCTGAAACTAGAACTACCAATCCATTACrrGOOT 

GAATTTATKX^ACATTGTTC^AT^ 

OCOGTCAAAATATGAA(XK}ATAAAGAAAATGT<SGAAATCTO 

GCCATAAAAGAGTGAAATCATGACATATGGAGGAAATGGATGCAACTO^AAATCAATTG 

GGCTAATCAAAACAAGACAGACTCAAAAAGGAAACA(^TOTAGCr^ 

GAAGCTAGATTTA^CTTGTACGTG^^TGTGTCTTTA 

ATTCTAATCTQTGAGTGTGTATAAAGGC ATGCATGTAAAGCA A A AAC AAGCTAGGTOGGO 
TOQGTAGGAGAGAAAGCAATGAGAGGAGTTAATAAGAACGAAGrATAGTAACATAGGTG 
CCA<KSATGAAAT(XATTAATTTCiTAT<XflAA 

AACC AGGGTGAAATCCCAGCAC AGAGA AOGGGAAGT AGACA OA A AGTTTCOCCACTAAC 

CXAGAAQCCATTrGCAGTTGCTOCCTGCTGGGAGGGGra 

ACroTGTATAA^CGAGTTGACAA^^ 

TTTCATI'i'l'l'lT lGITACTQTTTTOTrQTT QTQQTOGTTOTTGTOG^ 
ATTTGTTTCTTTTGAGAGAQAGAAGGAACATGAAArKSGGTG 
ATCTGGAAGAAQTIXXXKjAAAGAGAAAAATTQTATGGAGCATATTT^ 
AACAAACAA AAGGTTCATTTTGCXIAC^AAAAGQTGTGAATTAA ATTA ACCAflTTAfXrA^ 
CTrAAAGAAAATATTCCCAATTATTa^AGAGTTGCTATGTA 
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TExon2(Nw:*559) 

CTTOAACItKiCCCTATAACTCnXXfltn^ 
G^C^TGClTOCXGAacrCCC^ 
TG1XXIWC0GATOTGCCIXKTATCTCC(X^ 
TCCATCCCAGA«WGCAT<XXA«^ 

i ISFoc 6 755 

CKl\CClTtKXnX3AJCTa^ 
AAACACCTGTC^TTACTTrTAATAGTTTrcrTCT 
TGGAGACAATGAACATTTTATCCX^TAGTCTTTTACTAQAAOT 
GTTTCKKJACKCTCATAATTATGGGGCAGCmATTC^^ 

a<^qttrclxjttaacaatcattatoataccaaggaaga<kj^ 

aaaagcatttcttttgcaatttataaataccxattac^ 

ccttactaaatcnrgacaaattatggtga^ 

ataaatgcacmqaaatggoatatgccatcacctatgtgc<^^ 

ato<xaaacactattgttcacgtttggaw 

tataaccotataccca0tgcc11x3qaa<^gattx3caaacaqtaaaqact^ 
ctccctatga(xk3ao<x3gccttg<xktg^ 

<kxjaggtntatoataacttoacxx!aagctagagtca^ 

GAGAACATGarfGAAGAAGATCAGATTATAGGCAOGCCTCT 
TGATTCATGGGGCAGGGCCCAGTCX^TTGTTCGTGGTAC^ 

AAAAAAAAACA<3GCTXM,<K!AAaTGlCAAGOAGCAAGlX^GTCJAO 
TCTCrGC^TCAGCTCCTGCCTC^ 

CAGTGATGAACAATGATGTGGAAGTATAAG<XAAATAAATO7m 

TTGGTC^TGATGTITCATCACAGTGATAATAGTCCTC^ 

CCTTI^ACTAAATTCTGTTATCTATAGCTGAGGAAAATGG^ 

CTACACCAGAGTOTAATCTGGXXXTOAGCTTAG^ 

GGGGCTTCTTAACCGGAACACAGTTGTAAAAGOGAATTTT^ 

Exon3(Nuc8689)i 

TGTGOACTTTAATTGACXiArrCATCrGAAGCTGAAAATO^ 

C^^OATTOAQtGAAA4TCACmTCTGGCCTGAGCACtA^ 
GCAATGGC&TTCACAC&GTCAGTGAC^ 

GTGAGATAOOTAGAG<K3TGATGGA<KKn<3A<MAGAGAGG^ 

AAAAGCTGCTGATTCCCGTGGGAGACATTCTATAAOCATICTA^ 

TCAACXMAaMTTTCAATTGTAATCCAAT^ 

CTXXXX3AAAATGCTTACACAATT0K^ 

AAAG<^CAGCATGGCGGCAGACACCTGCAATTCCTGTCCC^ 

CCCAGAGACTX^TTGGCXJAAACACrcrATTCAATCAATG 

ACTTGACTCAGAAACTAATGTGGAAAGCATCAG<MAG^ 

CAT^TGAATAACKmTCCCAGAGAGMOGGAAGA 

MGGAAGGAAGGAAGO\AGGAAGGAA<K}AAGC^GAGAGG<3AGGA^ 

AAGaAAGGAAAGGGAAAGGAAAAAAGAGATGGGGAGGa^GGGAAGGAAAGGAMGaG 

GGAGAAAGAAGAGAAGAAAGGAAAATAAATAAATT^ 

nTTATC£ATAAAAGGTCATTItX^CCTGTTT^ 

AAGGGCAGTCXTTAACATAGTAGCATTITATAAA<XATrACAAATTTK^ 

TTrrTATCCTCrACCATtnTCAAACTGAAACTACA^ 

TGTAAGAGTTTreaCACACCGAAGTGGGAAAOTM 

AATGGGGAGAACAAAAAGAGACTOCSACAGGGAGCCCTTrciCTXKr^ 

TTOAGAACCTAATTGACCCTGCATGACTAC^^ 

ATGAAAAGCACAATAAAGTACATATGTCCCATAATATTCATCAAAAT^ 
A^TAG<^TCAAAGCAATAACACCCACTGTT^ 

ACTATGTCTAGTGTTTTGACITAGGGTACATAGTATGCTGTGTCTGTA 

ATTTAGGTCATCAGACAGCATTTGGAACATGTATCrrcAGGAGaAATCATTCATGTA 
GC^TGAAATTCTCCACCTATGTTTATTC^ 

GGGTTTGAGGrrrTACKXXAGGTAACATTTAGGGAAAAGCTGTCT 

CTT^ATTTATGAGGGATGTTGGTATTCCAGAAAATTC^^ 

GGTCAAAACAGAAAAATATCTAAAAAGTTATTGTTTTTATTAGTA^ 

TTTTAAAAATGOTATGCTTAGAACTMTTAAGATTAGATTAGATTAGATTAGAAAATAATC 

AGAGAGGGATITGATGAATGCTAAAGCATGATGAAAAATrCAAAATlTrTTGC^ 

GAGTGGATAAACTGTGTAACGTGAGTGCTATGTAGAAACAGAAAGGAGTGAAGGGTTOA ' 
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TGTGCGCTGCAACATCTTGAAAACATO 

ACATATTOCATGGlTATOTrTA^ 

GTAAGATGTTACTTAGGGTC^GGAA^ 

ATCTTAATTTGGGQAAAAOACAATTTCCT 

ATCCCTGIGGATATTGTAATAAACC^^ 

GCAGGAGGAAGTGAAATGAGATGGTAGAAAGGAAAGTCAT^^ 

GGIXKiAATCTAGATATGTTAATATATTXiACAT^ 

AAAACCAACAGGAGTGACXKjAGACAATAGG^^ 

TGTGTGTGGAGACACC&TAATAAA^^ 

TAAAAACAAAACATITITGCACAAGA^ 

GAAGTTGAAGTTCATTGATAGTCTCATAAATCTTAAATGTAT^ 

mATrATTAATTACTCTrGCTGTC^TTA 

ATGCTTTTCACCATACACAAAIXn'AGGCAGAAGAGTGTAATCCA 

GGAGA<K3GAAAGGAA.GCGGATGCGGGGCA0AGGCACACAG^^ 

ATQAACAAGAAAAAATGTGGACACATGCACAAAAATTCCATAGTCCACTACATTA 

TArTCTAATATTAAGAAAATAATAAACXXATTTCTGTGCACTT 

TTATCITGGCCACAGATCCTGTC^ 

AATCCAATCCAGGGCATGATGCTTACTXX^ 

AAGTAACCATGACATCTCTCTCHTCCm 

GTATGlTCrMmCrCT^ 

TGGAGTTAAGGIXXCATAGGATTAATCTC^ 

Exon4?(Nncll2S^)| 

TTGTGAAAGTTCCTTTGAGAAAACA^ 

AGCTATAACAAAGTO^AAT^ 

TOXACCfGGATCA^ 

GAGCTATCKX^M^TATTTAAAACCTCCTTC 

ACCTeC^XOUUAGAAATC^^ 

ACCC^TGGACCTGTGACTG^ 

Nve 11680 f 

ACTATCITGTTTGTTTGTITCTm 

TATCCCAAAAGTXXXXX^TAACCTCCCCC^ 

CCCTGGGATTCXXXJTGTACTGGGGCATAI^ 

AGTGATGGCCAACTAGGCCATCTTTTGAT^ 

TACTG<nTAGTTCATAATGTTGTTGCACCT 

N*clt967^ 

(SEQ IDNO:3) 
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geaoiiicjhu 


1 
1 


cgagagacgacagaaggttacggcfcgcgagaagacgacagaagggtccag 


dDKfc_rat 


51 
1 


aaaaaggaaagtgctggaggggagtggggacaaaagcagcgaercaagtga 


gwomicjbu 


101 
1 


atgtcacttcagtgactgaggccaggcaaaacgcgcgggaaggattttgt 


eD«A_rafc 


151 
1 


gtagcttgggaccctttcatagacactgatgacaegtttacgcaaaatag 


ganoaicjtra 


201 
1 


aaatttgaggagaaasgcctgggccttcggaaaggagtgattgattagta 


genonxc_Jhn 


251 
1 


cttgcaagtttaggtgactttaaggagaactaactaatgtatactattga 




301 
1 


gggaggaggaagagcattacagagtttccagcagcagcaggaaagctttg 


cDHXjcat 
genomicjlwi 


351 
1 


gttaatttggaaatggatgatagcattaaaafcaacagaagcgcctceagg 


cfcNAjrat 


401 
1 


tctctgaagcttcagtcccccagctgaaagccagaaaagactaagcccac 


oDNA_rat 


451 
1 


taagccttttgatecetfctggaageaaagaactttccttccctggggtga 


cam_rat 


501 
1 


agagtctgotcagaagatttcctgtctctgcctatgttacaagaggaatc 


genomie_hu 


551 
1 


aaaaccaagacagaagagctcaggatgcaggtgagaggcagggaagtcag 


genomic Hu 


£01 
1 


oggcttgttgatotccotoaotgotgtctgcotggtggtcaoooctggga 


cDNA_rat 
genom±o_hu 


651 
1 


gcagggcctgtcctcgccgctgtgcctgctatgtgcccacagaggtgcac 


genomic_hu 


701 
1 


tgtacatttcggtacctgacctccatcccagatggcatcccggccaatgt 


ganoSlc_hu 


751 
1 


ggaacgaataaatttaggatataacagccttactagattgacagaaaacg 


ca>HA_rat 
genomic_hu 


801 
1 


actttgatggoctgagcaaactggagttactcatgctgcaoagtaatggc 


cOKAjrat 
genouie_hu 


651 
1 


attcacagagtcagtgacaagaccttctcgggcttgcagtccttgcaggt 


cDHA^rat 
genomicjfcm 


901 
1 


cttaaaaatgagotataaoaaagtooaaatoattoggaaggataotttct 
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gonomicjbu 


$51 
1 


acggactcgggagcttggteeggttgcacctggatcacaacaacattgaa 


genomic hti 


1001 
1 


ttcatcaaccctgaggccttttatgfifacttacctcgctccgcttggtaca 




1051 
1 


tttagaaggaaaccggctcacaaagctccatccagacacatttgtotcat 




1101 
1 


taagctatctccagatatttaaaacctctttcattaagtacctgttcttg 




1151 
1 


tctgataacttcctgacctccctcccaaaagaaatggtotootaoatgco 




1201 
1 


aaacctagaaagcctgtatttgcatggaaacccatggacctgtgaetgc^ 


cDHk_rat. 
genojuio Jul 


1251 
1 


atttaaagtggttgtctgagtggatgcagggaaacccagatataataaaa 


cPN\jrat 
genoaicjhu 


1301. 
1 


tgcaagaaagacagaagctcttooagtcotcagoaatgtcccctttgcat 


genomic_hu 


1351 
1 


gaaccccaggatctctaaaggcagaccctttgctatggtaccatctggag 


cDKA_rat 
genoatic BQ 


1401 
1 


ctttcctatgtacaaagccaaccattgatccatcactgaagtoaaagagc 


cDKAjrat 
genoaic_bu 


1451 
1 


ctggttactcaggaggaoaatggatctgcctccacctcacctcaagattt 


eDKA_rat 
genomio_hu 


1501 
1 


catagaaccotttggctocttgtctttgaacatgacananatntctggaa 


cDNAjtat 
genoaicjhu 


1551 
1 


ataaggccgacatggtctgtagtatccaaaagccatcaaggacatcacca 


genomic hu 


1601 
1 


actgcattcactgaagaaaatgactacatcatgotaaatgogtcattttc 


<<DHA_rat 
gonoolo hu 


1651 
1 


oacaaatottgtgtgcagtgtagattataatcacatcoagccagtgtggc 


cflMtt^rat 
genomic 


1701 
1 


aacttctggctttatacagtgactctcctctgatact&gaaaggaageee 




1751 
1 


cagcttaccgagactccttcactgtcttctagatataaaoaggtggctct 


<JDOTj_rat 


1801 

1 


taggcctgaagacatttttaccagcatagaggctgatgtcagagcagacc 


genomic_hu 


1951 
1 


ctttttggttccaacaagaaaaaattgtcttgcagotgaacagaactgcc 


genoaicjttl 


1901 
1 


accacacttagoacattacagatooagttttooactgatgctcaaatcgc 


genomicjhu 


1951 

1 


tttaccaagggcggagatgagagcggagagantcaaatggaccatgatoo 
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genoaicjm 


2001 
1 


tgatgatgaacaatccaaaactggaacgcactgtcctggttggeggcact 




c0HA_rat 


2051 
1 


attgccctgagctgtccaggcaaaggcgacccttcacctcacttggaatg 




cDK\jrat 
geaooio_hu 


2101 
1 


gcttctagctgatgggagtaaagtgagagccccttacgttagcgaggatg 




aDM2L_rat 


2151 
1 


ggcgaatcctaatagacaaaaatgggaagttggaactgcagatggctgac 




gonomi o_Jrti 


2201 
1 


agctttgatgcaggtctttaccactgcataagcaccaatgatgcagatgc 




geno&iojxu 


2251 
1 


ggatgttctcacatacaggataactgtggtagagccctatggagaaagca 




gdftttticjui 


2301 
1 


cacatgacagtggagtcQagcacacagtggttacgggtgagacgctcgac 


p 
3 


genomic^fcu 


2351 
1 


cttccatgcctttccacgggtgttccagatgcttotattagotggattct 


4 


g«nosic_hu 


2401 
1 


tccagggaacactgfcgttctctcagccatcaagagacaggcaaattctta 


t'T: 


g<moSicJm 


2451 
1 


acaatgggaccttaagaatettacaggttacgocaaaagatcaaggtcat 


y 


g«no»ic_Jwi 


2501 
1 


taccaatgtgtggctgccaaccoatcaggggccgacttttccagttttaa 




genoatie^hu 


2551 
1 


agtttcagttcaaaagaaaggccaaaggatggttgagcatgaeagggagg 




dtTKA rat 
g^nomicjfciu 


2601 
1 






eDKA_rat 
genoaio^hu 


2651 
1 


ceagcatctttgaaactctctgcatcagctttgacagggtcagaggctgg 




cDNAjeat 
geno&£c_hu 


2701 
1 


aaaacaagtctccggtgtacataggaagaacaaaeatacraaacttaatacs 




geaoaicjxu 


1 


atcggcggcgt-ggggattccacgctcoggcgattcagggagcataggagg 




cDKAjcat 
genomie_hia 


2801 
1 


cagctccctctctctgctcggagaattgacccgcaacgctgggc&gcact 




g«nomicJhu 


2851 
1 


tctagaaaaagccaaaaagaattctgtgccaaaaaageaagaaaatacca 




dDH&jrat . 
genomie_J»i 


2901 
1 


c&gtaaagccagtgccactggctgttcccctcgtggaactcactgaogag 




gea^cdojiu 


2951 
1 


gaaaaggatgcctctggcatgattcctccagatgaagaattcatggttct 
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g©no»ic_hu 


3001 
1 


gaaaactaaggcttctggtgtcocaggaaggtcaccaactgctgactctg 


g©noaic_Jhu 


3051 
1 


gaccagfcaaatcatggttttatgacgagtatagettctggcacagaagtc 




3101 
1 


tcaactglrgaatocacaaacactacaatctgagcaccttcctgatttcaa 




3151 
1 


attatttagtgtaacaaacggtacagctgtgacaaagagtatgaacccat 


cDK\rat 
graoaicjta 


3201 
1 


ccatagcaageaaaatagaagatacaaceaaccaaaaeecaatcattatc 


geawaicjiu 


3251 
1 


tttccatcagtagctgaaattcgagattctgctcaggcaggaagagcatc 


genooic_hu 


3301 

1 


ttcccaaagtgcacaccctgtaacagggggaaacatggctacctatggcc 


oDK&_rat 
ganoaiajhu 


3351 
1 


ataccaacacatatagtagctttaccagcaaagccagtacagtcfctgeag 




3401 
1 


ccaataaatecaacagaaagttatggacctc&gatacctattacaggagt 


ganoaio^fru 


3451 
1 


cagcagacctagcagtagtgacatctcttctcacactactgcagacecta 


cDNA_rat 
g&nonl.c 2k& 


3501 
1 


gcttctccagtoacccttcaggttcacacaccactgcctogtotttattt 


genoaic_hu 


3551 
1 


caoattcctagaaacaacaataoaggtaacttccoottgtccaggcactt 


ga&OBlo^hu 


3601 
1 


gggaagagagaggacaatttggagcagagggagagttaaaaacccacata 


genoi&icjxu 


3651 
1 


gaaccccagttctccgacggcatagacacaggactgtgaggecagcaatc 




3701 
1 


aagggacctgctaacaaaaatgtgagccaagttccagccacagagtacco 


genoaie_hu 


3751 
1 


tgggatgtgccacacatgtccttccgcagaggggctcaoagtggctactg 


gsnonicjbu 


3801 
1 


cagcactgtcagttccaagttcatcccaaagtgooctccceaaaactaat 


cDNA_r&t 


3851 
1 


aatgttggggtcatagcagaagagtctaccactgtggtcaagaaaccact 




3901 
1 


gttactatttaaggaoaaacaaaatgtagatafctgagataataacaacca 




3951 
1 


ctacaaaatattccggaggggaaagtaaccaogtgattoctacggaagca 


cDNA_rat 


4001 
1 


agcatgaottctgctccaacatctgtatccctggggaaatctcctgtaga 


dDK&_rat 


4051 
1 


caatagtggtcacctgagcatgcctgggaccatccaaactgggaaagatt 




4101 
1 


cagtggaaacaacaccacttcccagccccctcagcacacectcaatacca 
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cD&ft^argtfc 4151 acaageaeaaaattctcaaagaggaaaactccottgcaccagatctttgt 

geuo5Iic_hu 1 — ~~ 

cDNA_rat 4201 aaataaccagaagaaggaggggatgttaaagaatccatatcaattcggtt 

gaj&oaicjbu 1 — — — - 

oDK&_rat 4251 tacaaaagaacccagccgcaaagottcccaaaatagctcctcttttaccc 

g^n rwai o^jiu 1 — — — — -— ---------- — — — — — — 

cDHAj:at 4301 acaggtcagagttccccctcagattctacaaotctcttgacaagtccgoc 

gdaottLc^bu 1 ~ ~ 

cBHA_»t 4351 aeeagctctgtctacaacaatggctgccactcagaacaagggcactgaag 

genoaicjhtj 1 — — — — 

cONA_rat 4401 tagtatoaggtgccagaagtctctcagcagggaagaagcagcccttoaoc 

g^noni1c_hn 1 ■ 

cDMA_xat 4451 aactootctccagtgcttcctagcaccataagoaagagatctaatacatt 

geno&£c_lm 1 — — 

cDNft^xat 4S01 aaacttcttg-tcaacggaaacooooacagtgacaagtcctactgctactg 

genoaic_hu 1 — 

cDN&jc&t 4551 cafcctgtcattatgtctgaaacccaacgaacaagatccaaagaagcaaaa 

genoal o_jm 1 

cCHAjrat 4601 gaccaaataaaggggcctcggaagaacagaaacaacgcaaacaccacccc 

geno&ic_Kui 1 — -- 

cDHArat 4651 caggoaggtttctggctatagtgcatactcagctctaacaacagotcata 

genomie^hu l —.—-^ 

c©»\_rat 4 701 ccccctt ggcttt cagtcattccccacgacaagatgatggtggaaatgta 

9eaQmi.G__h.t1 1 — , fc 

cB»&_rat 4751 agtgoagttgcttatcactcaacaaoctctcttctggccataactgaact 

geaofttlo^Jbu 1 — — — . 

<*»*\.f*t 4 * 0i ^^^^aagtacacccagactttgggaaatacaacagctttggaaacaa 

^JHO^^L ^ ^i X X " "™ ^ ™* ^ » ^ « w *~ ^ « « a* w n ^ * ^ h ■» m ^ H ^ a ^ ^ ^ _ ^ 

eDN&rat 4851 cgttgttgagcaaatcacaggagagtacoaoagtgaaaagagcet cacao 

genostio_hu 1 . . _ 

dtft^xat 4901 acaccacceccactcctcagcagtggggcgcccccagtgcccactccttc 

g^eneaie^hu 1 

oDHA_rat 4951 cccacctccttttactaagggtgtggttacagacagcaaagtcacatcag 

genome Ixu 1 •<—.-.—.-. — , — ——«,__ 

cDKA_rat 5001 ctttccagatgacgtcaaatagagtggtcaccatatatgaatcttcaagg 

genoauLc^hu 1 - — - — * 

eDHA_zat 5051 cacaatacagatctgcagcaaccctcagcagaggctagccccaatootga 

go&oi-irC__iiu 1 ———--—--. 

cDHA_rat 5101 gatcataactggaaccactgactotocctctaatctgtttccatccactt 

genomic Jati 1 - m * 

gaao^^fau S1S 1 Ct ^ gccagcactaag ^ ta ^ ataa?ccaca ^ attc taaatggaagccc 

ci5K \-f at 5201 tctccctggccagaacacaaatatcagctcaagtcatactccgaaawat 

gencmic_nu 1 »— — 
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cDNA_rat 

ciDHArat 
g©nomie_hu 

-genomic_hu 

cCRkjrat 

genoaiojtm 

goncmicjhtt 

cDNA_rat 
genoaicjbu 

g<momic_h\i 
dBNJlrat 

CflPB&jrat 
ge&onxc 2ux 

aDNAjrat 

cDQ&jrafc 
g«nomlc_fau 

cDHA_rat 
gsnomlo^hu 

cDKA rat 

genosic_hu 

oDKA_rat 
g^mo&ic h.U 

ge»ai&io_hu 

eDNAjcat 
genomic 

<*DNAjrat 
gencSicJtLtt 

genomic hti 



5251 tgagaagggcaaaaggccagcagtaagcatgtccccccacctcagccttc 

53 °1 ^^ agsoc&gcactcat ^ cctcacact ^ A ai^cacagaagcatgcagaa 

5351 aagagtgtttttgataagaaacctggtcaaaacocaacttccaaacatct 

5401 gc^ttacgtctctctacctaagactotattgaaaaagccaagaataattg 

5451 gaggaaaggctgcaagctttacagttccagctaattcagaogtttttctt 

5501 ccttgtgaggctgttggagacccactgcccatcatccactggaocagagt 

5551 ttcatcagganttgaaatatcccaagggacacagaaaagccggttccacg 

5601 tgottoooaatggcaccttgtccatccagagggtcagtattcaggaccgt 

5651 ggacagtacctgtgctctgcatttaatooaotgggcgtagaccattttca 

5701 tgtctctttgtctgtggttttttacccggcaaggattttggacagacatg 

57i>i tcaagga^atuacagttcactttgg^agtaQtgtggaactaaagtgcaga 

S801 gtggagggtatgccgaggcctacggtttcctggatacttgcaaaccaaac 

5*51 ggtggtctcagaaacggccaagggaagcagaaaggtctgggtaacacctg 

r 

5901 atggaacattgatoatctataatctgagtctttatgatcgtggtttttac 

5951 aagtgtgtggccagcaacccatctggccaggattcactgttggttaagat 

€001 acaagtcatcacagctccccctgtcattatagagcaaaagaggcaagcca 

€051 tcgttggggttttaggtggaagtttgaaaotgocctgcactgcaaaagga 

6101 actccccagcctagtgttcactgggtcctttatgatgggactgaactaaa 

6151 accattgcagttgactcattccagatttttcttgtatccaaatggaactc 

6201 tgtatataagaagcatcgctccttcagtgaggggcacttatgagtgcatt 

6251 gccaccagctcctcaggctcagagagaagggtagtgattcttactgtgga 
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rat 


7351 


CttTGGAAGCAGAGTCACGGTCCATCCAAAT^ 


9 






890 


C t aT GGAAGGAGAAxCACAGTCCATAAAAaT GGAACCTTGGAaa t ta gg a 






rat 


7401 


acatccGGCTTTCTGACTCTGCGGaCTTCACCTGTGTGGttcggagcgag 


*- 






940 


atgtgaGGCTTTCAGATTCAGCCGACTTTATCTGTGTWcccgaasttgaa 






:at 


7451 


ggaGGAGAGAGTGTGTTGGTAQTGCAGTTAGAAGTCCTAGAAATGCTGAG 


& 




.cjbu 


990 


ggtGGAGAGAGCGTGTTGGTAGTACAGTTAGAACT 






rat 


7501 


AAGACCAACATTC^GAAACCCATTCAACGAAAAag±catcgcccaagctg 


t 






J.04U 


AAGACCGAC AT TT AGAfUVl L/UA. 1 J. iAAl ^AAAAaauaguuyCCCa.gc uy y 






rat 


7551 


gcaagcccgtJUK^CTGAACTGCTCTGTGGATGGGAACCCCCCACCTG^ 


ft 
\ 




,c_lm 


1090 


gaaagtccacAGCATTGAATTGCTCTGTTGATGGT^ 






at 


7601 


a 1 1 a cCTGGATG TTACCTGACGGCACACAGTTTGCTAACAGACCAceca a 


V 

r 




,qjta 


1140 


ataatCTGGATTCTACC&AATGGCACACGATTTO 






rat 


7651 


ttccccGTATCTGATGGGAGGCAATGGCTCTCTCATCCTTTACAAAgcaa 


-< 






1190 


ttatcaGTATCTGATAGCAAGCAATGGTTCTTTTATCATTTCTAAAacaa 






rat 


7701 


Ct cggaac^agtCAGGGAAGTATCGCTGTGCAGCCAGGAATAAGGTTGGC 


p 




.cjra 


1240 


ctcgggaggatgCAGGAAAATATCGCTGTGCAGCTAGGAATAAAGTTGGC 




13 


rat 


7751 


TAcat:cgagaaactcatcctgttagagATTGGG»GAAGCCAGTCATTCT 




111 




1290 


TAtattgagaaattagtcatattagaaATTGGCCAGAAGCCAGTTATTCT 






rat 


7601 


GACATACGAACCAGGGAtggtgaagagcgTCAGTGGGGAACCGTTATCAC 




JSC. 




1340 


TACCTAT GGACCAGGGAoag r aaaaggoaTUAt? rcstyAWATui ui ai uau 




*,„. 


rat 


7851 


TGCATTGTGTGTCTGATGGgATCCCCAAGCCAAATGTC^GTGGACTAOT 


1 




*C TO 


* ion 








rat 


7901 


CCGGGTGGccATGTAATCGACAGGCCTCAAGTGGATGGAAAATACATAcT 


c 
c 


(4: 




1440 


firm. * (■"•ni V a mffl** f^Jl/* iftVf^flV** * R iPfll X %fP/'2S2/2'R Hfc(Pfcr , 2iT2l+*'P 
CGAAGTGGtl^TGTAGTAt^CAGGCUTw uCffvsAAAl AUA1 At A 




W 


rat 


7951 


gcmgaaaatggcacxk;tggtcatcaaag^ 


\ 
\ 




L©J«* 


1490 


GCATGACAATGGCiWCTTAGTCAl XAAAGftA(awyiw\*^- ttar gacagH^ 






:at 


8001 


GAAATTATATCTGTAGGGCTCAAAACAGTGTTGGCCAggcagttattago 


I 






1540 


GAAACTAT ATCTGT AAGGCTCAAAATAlb 1 fa I i w X UAta c a c v gaixac t 


1 




rat 


8051 


gtgtCAGTGATGGTTGTGGCCTACCCTCCCCGAATCATAAActacctACC 


1 




•c_hu 


1590 


gt t CCAGTAAXGAa iif 1 Avv^ 1 At#k>L» rL^vwin 1 Invnnn VOy fc.ccziv»A* 






rat 


8101 


(^GGAACATGCTCAGGAGGACAGGGGAAGCCatgCAGCTCCACTG 






,C hu 


1640 


CAGGAGT ATT GT CAC vAG<^C At^i2»<5Wi^<JCt: u H o AkjC i m*ax- i^ivji w 






rat 


eisi 


CCTTGGGAATCCCCAAGCCAaAAffPCACCTGGGAGACGCCAAGAGACTCC 






.c fan 


1690 








rat 


8201 


CTGCTCTCAAaagcaacagcaagaaaacccCATAGAAGTGAGATGCTTCA 








1740 


CTTCTCTCAAcggcaagtaaagagaggaoaCATGGAAGTGAGCAGCTTCA 






rat 


B251 


CXX^ACAAGGTACGCTgGTCATTC^GAATCTCCAAACCTCGGATTCCGGag 






.ojbu 


1790 


CTTACAAGGTACCCTaGTCATTCAGAATCCCCAAACCTCCGATTCTGGga 






rat 


8301 


tcTATAAGTGCAGAGCTCAGAACCTACTTGGgacTGATTACGCAACAACT 








1840 


taTACAAATGCACAGCAAAGAACCCACr'TGGtagTGATTATGCAGCAACG 






rat 


8351 


TACATCCAGGTACTCTGACAGGAAgggggagactaaaattoaaoagaagt 






.a hu 


1890 


TATATTCAAGTAATCTGACATGAAataataaagtcaacaacatctgggca 
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cDKA_rafc 8401 ccacatccacagggTTTATTTTTTGGAAGMGTTTAATCAAAGGCAGCCA 

gmeaic a _lm 1940 gaa -TTTATTTTTTGGAAGAAGTTTi^TCAMGGCAGCCA 

cDHAjrat 8451 TAGGCATGTAAATGAgtcTGMTACATTTACAGTAT TAAATTTACAAT GG 

gon<MilG_hu 1979 TAGGCATGTAAATGAattTGAMA(^Tl!!PACAGTATTAAATTTAC^TGA 

c©NA_rat 8501 ACATGCgatga GACTTGTAAATGAAACCATTGTGAACTGAaaccg 

SOTomichu 2029 ACATGCaaaataaaagGACTTGTAAATAMTGCATTATGW^GAtgata 

fiDHAjrat 8546 agtctctg TGWTCTCAAAGCAAACTCTTAACTTAAGGCACTXTg 

gaAOttic_hia 207 9 ctgatttatttaaTGGATCTCAAAACAAACTTTrAACTTAAGGCACTTTt 

cDHA^rat 8591 ATTCTrcCAAO^TAAT^^ 

gencmiejiu 2129 ATTTTGCCAACAAATAACAATAAACAaacattgaaacggtt CACTA 

d>NA_rat 8641 CGAAATAACAAACGGCTAATGCACCTGAATTctcagtaaaaagacctttc 

^aomichu 2175 TAAAATAACAAATGGCTAATGXACCTGAATTtttcagtaaaaaaatgaac 

cOTA^rat 8691 tctcgctaaoagttgCCAGCTGCCTCGTOTCTGTTTCCTACCAATGTCAC 

g«nom±cJru 2225 ttctaata CCAGTTGCCTAGTGTCCACCTCCTATCAATGTTAC 

<3>NA^rat 8741 AAACAtcgcacacagggtGAATGGAGTCAACGGGAAAGATTAAGTTTGCG 

g©nooic_hu 2268 AAGCAtggcactca GAACAGAGACAATGGAAAAT ATTAAATCT GCA 

cDNAjrat 8791 GTCtgtgtaaatctca ATGTACAAATATTCTGtcncTG 

genomlajm 2314 ATCtttatgatgtaaatttaccatcctgATGTATAAATATTTTG TG 

exonl (2342-2397) 

oDNA_rat 8791 GTCtgtgtaaatctca ATGTACAAATATTCTGtcncTG 

genoa£cJra 2314 ATCtttatgatgtaaatttaecatccfcgATGTATAAATATTTTG^ TG 

cDNA_rat 8S29 GTTTATAAACATTTTGATAAAACCGAAAAAAAAAAAAAaaaaaaaaaaaa 

g*fcomio_lm 2360 GTTTATAAATTTTTTTGCTAAAACCTACAGAAAATAAGGactgaactgtc 

(Genomic human OCP: SEQ ID NO:4) 
(cDNArat: SEQ ID NO:5) 
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Figure 1 1 



F x on /Tn tron 
No. 


Exon 
start 


Exon 
end 


Exon 
length 


Intron 
length 


Remarks 


1 


1 


208 


208 


69 


No valid splice site found upstream 
this exon 


2 


277 


429 


153 


18 




3 


447 


485 


39 


1561 




4 


2046 


2244 


199 


1351 




5 


3595 


3724 


130 


3254 




6 


6978 


7359 


382 


4123 




7 


11482 


14903 


3422 


38 




8 


14941 


15307 


367 


51 




9 


15358 


15825 


468 


1039 




10 


16864 


-17760 


-897 


? 


Last exon might be complete 


11 


?(D 


2317 


-231 
7 


25 


Exon is not complete and start site is 
not known 


12 


2342 


2397 


56 







608 Human translated nucleotide sequence (ORF) 

ATGAAGGTAAAAGGCAGAGGAATCACCTGCTTGCTGGTCTCCTTTGCTGT 

GATCTGCCTGGTCGCCACCCCTGGGGGCAAGGCCTGTCCTCGCCGCTGTG 

CCTGTTATATGCCTACGGAGGTACACTGCACATTTCGGTACCTGACTTCCA 

TCCCAGACAGCATCCCGCCCAATGTGGAACGCATCAATTTAGGATACAAC 

AGCTTGGTTAGATTGATGGAAACAGATTTTTCTGGCCTGACCAAACTGGA 

GTTACTCATGCTTCACAGCAATGGCATTCACACAATCCCTGACAAGACCTT 

CTCAGATTTGCAGGCCTTGCAGGTCTTAAAAATGAGCTATAATAAAGTCC 

GAAAACTTCAGAAAGATACTTTTTATGGCCTCAGGAGCTTGACACGATTG 

CACATGGACCACAACAATATTGAGTTTATAAACCCAGAGGTTTTTTATGG 

GCTCAACTTTCTCCGCCTGGTGCACTTGGAAGGAAATCAGCTCACTAAGCT 

CCACCCAGATACATTTGTCTCTTTGAGCTACCTCCAGATATTTAAAATCTC 

TTTCATTAAGTTCCTATACTTGTCTGATAACTTCCTGACCTCCCTCCCTCAA 

GAGATGGTCTCCTATATGCCTGACCTAGACAGCCTTTACCTGCATGGAAA 

CCCATGGACCTGTGATTGCCATTTAAAGTGGTTGTCTGACTGGATACAGCC 

AGATGTAATAAAATGCAAAAAAGATAGAAGTCCCTCTAGTGCTCAGCAGT 

GTCCACTTTGCATGAACCCTAGGACTTCTAAAGGCAAGCCGTTAGCTATG 

GTCTCAGCTGCAGCTTTCCAGTGTGCCAAGCCAACCATTGACTCATCCCTG 

AAATCAAAGAGCCTGACTATTCTGGAAGACAGTAGTTCTGCTTTCATCTCT 

CCCCAAGGTTTCATGGCACCCTTTGGCTCCCTCACTTTGAATATGACAGAT 

CAGTCTGGAAATGAAGCTAACATGGTCTGCAGTATTCAAAAGCCCTCAAG 

GACATCACCCATTGCATTCACTGAAGAAAATGACTACATCGTGCTAAATA 

CTTCATTTTCAACATTTTTGGTGTGCAACATAGATTACGGTCACATTCAGC 

CAGTGTGGCAAATTTTGGCTTTGTACAGTGATTCTCCTCTGATACTAGAAA 

GGAGCCACTTGCTTAGTGAAACACCGCAGCTCTATTACAAATATAAACAG 

GTGGCTCCTAAGCCTGAAGACATTTTTACCAACATAGAGGCAGATCTCAG 

AGCAGATCCCTCTTGGTTAATGCAAGACCAAATTTCCTTGCAGCTGAACA 

GAACTGCCACCACATTCAGTACATTACAGATCCAGTACTCCAGTGATGCT 

CAAATCACTTTACCAAGAGCAGAGATGAGGCCAGTGAAACACAAATGGA 

CTATGATTTCAAGGGATAACAATACTAAGCTGGAACATACTGTCTTGGTA 

GGTGGAACCGTTGGCCTGAACTGCCCAGGCCAAGGAGACCCCACCCCACA 

CGTGGATTGGCTTCTAGCTGATGGAAGTAAAGTGAGAGCCCCTTATGTCA 

GTGAGGATGGACGGATCCTAATAGACAAAAGTGGAAAATTGGAACTCCA 

GATGGCTGATAGTT1TGACACAGGCGTATATCACTGTATAAGCAGCAATT 

ATGATGATGCAGATA'rrCTCACCTATAGGATAACTGTGGTAGAACCTTTGG 

TCGAAGCCTATCAGGAAAATGGGATTCATCACACAGTTTTCATTGGTGAA 

ACACTTGATCTTCCATGCCATTCTACTGGTATCCCAGATGCCTCTATTAGC 

TGGGTTATTCCAGGAAACAATGTGCTCTATCAGTCATCAAGAGACAAGAA 

AGTTCTAAACAATGGCACATTAAGAATATTACAGGTCACCCCGAAAGACC 

AAGGTTATTATCGCTGTGTGGCAGCCAACCCATCAGGGGTTGATTTTTTGA 

TTTTCCAAGTTTCAGTCAAGATGAAAGGACAAAGGCCCTTGGAGCATGAT 

GGACrAAACAGAGGGATCTGGACTTGATGAGTCCAATCCTATTGCTCATCT 

TAAGGAGCCACCAGGTGCACAACTCCGrACATCTGCTCTGATGGAGGCTG 

AGGTTGGAAAACACACCTCAAGCACAAGTAAGAGGCACAACTATCGGGA 

ATTAACACTCCAGCGACGTGGAGATTCAACACATCGACGTTTTAGGGAGA 

ATAGGAGGCATri'CCCTCCCTCTGCTAGGAGAATTGACCCACAACATTGG 

GCGGCACTGTTGGAGAAAGCTAAAAAGAATGCTATGCCAGACAAGCGAG 

AAAATACCACAGTGAGCCCACCCCCAGTGGTCACCCAACTCCCAAACATA 

CCTGGTGAAGAAGACGATTCCTCAGGCATGCTCGCTCTACATGAGGAATT 



TATGGTCCCGGCCACTAAAGCTTTGAACCTTCCAGCAAGGACAGTGACTG 

CTGACTCCAGAACAATATCTGATAGTCCTATGACAAACATAAATTATGGC 

ACAGAATTCTCTCCTGTTGTGAATTCACAAATACTACCACCTGAAGAACCC 

ACAGATTTCAAACTGTCTACTGCTATTAAAACTACAGCCATGTCAAAGAA 

TATAAACCCAACCATGTCAAGCCAAATACAAGGCACAACCAATCAACATT 

CATCCACTGTCTTTCCACTGCTACTTGGAGCAACTGAATTTCAGGACTCTG 

ACCAGATGGGAAGAGGAAGAGAGCATTTCCAAAGTAGACCCCCAATAAC 

AGTAAGGACTATGATCAAAGATGTCAATGTCAAAATGCTTAGTAGCACCA 

CCAACAAACTATrATTAGAGTCAGTAAATACCACAAATAGTCATCAGACA 

TCTGTAAGAGAAGTGAGTGAACCCAGGCACAATCACTTCTATTCTCACAC 

TACTCAAATACTTAGCACCTCCACGTTCCCTTCAGATCCACACACAGCTGC 

TCATTCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAACATCCCGCTGTT 

CAGACGCTTTGGGAGGCAGAGGAAAATTGGCGGAAGGGGGCGGATTATC 

AGCCCATATAGAACTCCAGTTCTGCGACGGCATAGATACAGCATTTTCAG 

GTCAACAACCAGAGGTTCTTCTGAAAAAAGCACTACTGCATTCTCAGCCA 

CAGTGCTCAATGTGACATGTCTGTCCTGTCTTCCCAGGGAGAGGCTCACCA 

CTGCCACAGCAGCATTGTCTTTTCCAAGTGCTGCTCCCATCACCTTCCCCA 

AAGCTGACATTGCTAGAGTCCCATCAGAAGAGTCTACAACTCTAGTCCAG 

AATCCACTATTACTACTTGAGAACAAACCCAGTGTAGAGAAAACAACACC 

CACAATAAAATATTTCAGGACTGAAA1TTCCCAAGTGACTCCAACTGGTG 

CAGTCATGACATATGCTCCAACATCCATACCCATGGAAAAAACTCACAAA 

GTAAACGCCAGTTACCCACGTGTGTCTAGCACCAATGAAGCTAAAAGAGA 

TTCAGTGATTACATCGTCACTTTCAGGTGCTATCACCAAGCCACCAATGAC 

TATTATAGCCATTACAAGGTTTTCAAGAAGGAAAATTCCCTGGCAACAGA 

ACTTTGTAAATAACCATAACCCAAAAGGCAGATTAAGGAATCAACATAAA 

GTTAGTTTACAAAAAAGCACAGCTGTGATGCTTCCTAAAACATCTCCTGCT 

TTACCACAGAGACAAAGTTCCCCTTTCCATTTCACCACACTTTCAACAAGT 

GTGATGCAAATrCCATCTAATACCTTGACTACCGCTCACCACACTACGACC 

AAAACACACAATCCTGGAAGTCTTCCAACAAAGAAGGAGCTTCCCTTCCC 

ACCCCTTAACCCTATGCTTCCTAGTATTATAAGCAAAGACTCAAGTACAA 

AAAGCATCATATCAACGCAAACAGCAATACCAGCAACAACTCCTACCTTC 

CCTGCATCTGTCATCACTTATGAAACCCAAACAGAGAGATCTAGAGCACA 

AACAATACAAAGAGAACAGGAGCCTCAAAAGAAGAACAGGACTGACCCA 

AACATCTCTCCAGACCAGAGTTCTGGCTTCACTACACCCACTGCTATGACA 

CCTCCTGCTCTGGCATTCACTCATTCCCCACCAGAAAACACAACTGGGATT 

TCAAGCACAATCAGTTTTCATTCAAGAACTCTTAATCTGACAGATGTGATT 

GAAGAACTAGCCCAAGCAAGTACTCAGACTTTGAAGAGCACAATTGCTTC 

TGAAACAACTTTGTCCAGCAAATCACACCAGAGTACCACAACTAGGAAAG 

CATCATTAGACACTCCCATACCACCATTCTTGAGCAGCAGTGCTACTCTAA 

TGCCAGTTCCCATCTCCCCTCCCTTTACTCAGAGAGCAGTTACTGACACAC 

GTGGCGACTCCCATTTCCGGCTTATGACAAATACAGTGGTCAAGCTGCAC 

GAATCCTCAAGGCACAATCTCCAAATGCCAAGTTCACAATTGGAACCACT 

CACTTCATCTACCTCTAATCTGTTACATTCTACTCCCATGCCAGCACTAAC 

AACAGTTAAATCACAGAATTCCAAATTAACTCCATCTCCCTGGGCAGAAT 

ACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGCAAAAAG 

CCAGAAGTAAGCATGTTGGCTACTACAGGCCTGTCCGAGGCCACCACTCT 

TGTTTCAGATTGGGATGGACAGAAGAACACAAAGAAGAGTGACTTTGATA 

AGAAACCAGTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCTT 

TGTCTAGGTATATATTTGAAAAGCCCAGGATAGTTGGAGGAAAAGCTGCA 

AGTTTTACTATTCCAGCTAACTCAGATGCCTTTCTTCCCTGTGAAGCTGTT 



GGAAATCCCCTGCCCACCATTCATTGGACCAGAGTTTCAGGACTTGATTTA 

TCTAGAGGAAACCAGAATAGCAGGGTCCAGGTTCTCCCCAATGGTACCCT 

GTCCATCCAGAGGGTGGAAATTCAGGACCGCGGACAGTACTTGTGTTCCG 

CATCCAATCTGTTTGGCACAGACCACCTTCATGTCACCTTGTCTGTGGTTT 

CCTATCCTCCCAGGATCCTGGAGAGACGTACCAAAGAGATCACAGTTCAT 

TCCGGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCC 

CTACAGTTACCTGGATTCTTGCAAACCAAACAGTTGTCTCAGAATCATCCC 

AGGGAAGTAGGCAGGCTGTGGTGACGGTTGACGGAACATTGGTCCTCCAC 

AATCTCAGTATTTATGACCGTGGCTTTTACAAATGTGTGGCCAGCAACCCA 

GGTGGCCAGGATTCACTGCTGGTTAAAATACAAGTCATTGCAGCACCACC 

TGTTATTCTAGACCAAAGGAGGCAAGTCATTGTAGGCACTTGGGGTGAAA 

GTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCTCAGCCCAGCGTTTACT 

GGGTCCTCTCTGATGGCACTGAAGTGAAACCATTACAGTTTACCAATTCCA 

AGTTGTTCTTATTTTCAAATGGGACTTTGTATATAAGAAACCTAGCCTCTT 

CAGACAGGGGCACTTATGAATGCATTGCTACCAGTTCCACTGGTTCGGAG 

CGAAGAGTAGTAATGCTTACAATGGAAGAGCGAGTGACCAGCCCCAGGA 

TAGAAGCTGCATCCCAGAAAAGGACTGAAGTGAATTTTGGGGACAAATTA 

CTACTGAACTGCTCAGCCACTGGGGAGCCCAAACCCCAAATAATGTGGAG 

GTTACCATCCAAGGCTGTGGTCGACCAGTGGAGCTGGATCCACGTCTACC 

CTAATGGATCCCTGTTTATTGGATCAGTAACAGAAAAAGACAGTGGTGTC 

TACTTGTGTGTGGCAAGAAACAAAATGGGGGATGATCTGATACTGATGCA 

TGTTAGCCTAAGACTGAAACCTGCCAAAATTGACCACAAGCAGTATTTTA 

GAAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTAGATTGCAAAGCTTCC 

GGCTCCCCAGTGCCAGAGATATCTTGGAGTTTGCCTGATGGAACCATGAT 

CAACAATGCAATGCAAGCCGATGACAGTGGCCACAGGACTAGGAGATAT 

ACCCTTTTCAACAATGGAACTTTATACTTCAACAAAGTTGGGGTAGCGGA 

GGAAGGAGATTATACTTGCTATGCCCAGAACACCCTAGGGAAAGATGAA 

ATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCCCGGATAAGGCAGAG 

TAACAAAACCAACAAGAGAATCAAAGCTGGAGACACAGCTGTCCTTGACT 

GTGAGGTCACTGGGGATCCCAAACCAAAAATATTTTGGTTGCTGCCTTCC 

AATGACATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGG 

TCTTTGACCATCAACAAAGTGAAACTGCTCGATTCTGGAGAGTACGTATG 

TGTAGCCCGAAATCCCAGTGGGGATGACACCAAAATGTACAAACTGGATG 

TGGTCTCTAAACCTCCATTAATCAATGGTCTGTATACAAACAGAACTGTTA 

TTAAAGCCACAGCTGTGAGACATTCCAAAAAACACTTTGACTGCAGAGCT 

GAAGGGACACCATCTCCTGAAGTCATGTGGATCATGCCAGACAATATTTT 

CCTCACAGCCCCATACTATGGAAGCAGAATCACAGTCCATAAAAATGGAA 

CCTTGGAAATTAGGAATGTGAGGCTTTCAGATTCAGCCGACTTTATCTGTG 

TGGCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTA 

CTGGAAATGCTGAGAAGACCGACATTTAGAAATCCATTTAATGAAAAAAT 

AGTTGCCCAGCTGGGAAAGTCCACAGCATTGAATTGCTCTGTTGATGGTA 

ACCCACCACCTGAAATAATCTGGATTTTACCAAATGGCACACGATTTTCCA 

ATGGACCACAAAGTTATCAGTATCTGATAGCAAGCAATGGTTCTTTTATCA 

TTTCTAAAACAACTCGGGAGGATGCAGGAAAATATCGCTGTGCAGCTAGG 

AATAAAGTTGGCTATATTGAGAAATTAGTCATATTAGAAATTGGCCAGAA 

GCCAGTTATTCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGAG 

AATCTCTATCACTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCA 

AATGGACTATGCCAAGTGGTTATGTAGTAGACAGGCCTCAAATTAATGGG 

AAATACATATTGCATGACAATGGCACCTTAGTCATTAAAGAAGCAACAGC 

TTATGACAGAGGAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCATA 



CACTGATTACTGTTCCAGTAATGATTGTAGCCTACCCTCCCCGAATTACAA 

ATCGTCCACCCAGGAGTATTGTCACCAGGACAGGGGCAGCCTTTCAGCTC 

CACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACATGGGAGATGCC 

TGACCACTCCCTTCTCTCAACGGCAAGTAAAGAGAGGACACATGGAAGTG 

AGCAGCTTCACTTACAAGGTACCCTAGTCATTCAGAATCCCCAAACCTCC 

GATTCTGGGATATACAAATGCACAGCAAAGAACCCACTTGGTAGTGATTA 

TGCAGCAACGTATATTCAAGTAATCTGA 
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Figure 13 
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Figure 14 



h«nSn_5+3__correcte4 
mus cDNA 5 



huznanj5+3_ corrected 
zeus cDNA 5 



rat_cDNA 

human_5+3_correcte<? 
RtttS cDNA 15 



rat_c0R& 

huaan_5+3^corrected 
mus com ? 



human_5+3_corrected 
mus cDNA 5 



r*t_cDNA 

hufflan_5+3_correcte<l 
mus cDNA 5 



rat_CDl$A 

human__5 +3_correct £d 
raus cDNA 5 



r*t_cDNA 

human_5+3_corrected 
mis cDNA 5 



rat_cDNA 

huit^_5+3^corrected 
mus cDNA 5 



xat_cDNA 

hiHoan_5+3_correctM 
mus cPKA 5 



human^543_corrected 
mus_c0NA_$ 



himan_5+3_correctedl 
tttts cDNA 5 



huaaii_5+3 - corrected 
mas cDKA 5 



CGAGAGACGACAGAAGGTF^^ 



<*XftG<X&AAAC^CGGG^^ 
GCCAGGCiU^G(3aGCGGGftAS^ 

GACGTOTTTCTGC^^ 
GT^TTGATraOTACTTGCAAOTT*^ 



GTGATCX^TTAGTACTTGTAiV31TOAGGrt3A^TT GAGAACTARCTA^CCTATACTA 

T^GAGGG^GAGGAAGAGCA^ 



TTGAG<j<5AGAAGGAAGAGCA r rT CCAGGAGCAGCAGGAAASCTTTGGTTAGT 

TTGGAAATGGATGATAGCA^ 



CCCCCAGCTGAAA9CCAGAAAAGACTAAG 



CCCCCAGCT --a-<stg^aagcctach:aaq2ctt^ 

AAGAACTTTCCTTCCC^^ 



AA^CGTIIXT^C^CAGGTGAAGGCTC^ 

immri — - ~rr ^^aagg^aaaaggo^ggaa 

*±* ** * ****** * * 

TCkGCQGCTTGTTGA^ 
rCACCTGCTXGCTGGTCTCmTG^ 

*** * ***** ***** * *** ********** * ********** **» ** 
CCTOTCCTCGCCGCTCT^ 

************ ******** *** **** * 4 ***** ***** ********* *** 

TGftCCTCCATC03U3^^ 

TG&CTOCATCCC&SACAG- 

T^CCTCCATCCO^CGGGCMCCCA^ 

**** *********** * ****** ************* * ******** ** *** 
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huiaaii_5 4-3_co r r«cted 
mus cDNA 5 



AGCCTCACTAGM^C^CAGAAft^ 

* ******** **** ** *** ******** ** **************** 



rat_c0NA 

huaan_5+3_corrected 
mug c5sa 5 



CTTCAC&GCAATGGCOT 
CTGCACAGCAATGGCATTt^C?^^ 

** ***** ************* * ** *************** * ****** ***** 



ratjCDBA 

hTfl^_5+3_corrected 
mus cP&A 5 



CftG<nKSTAAAAAXGa£^ 

*********************** ******* ** ** *** ******** ** ** 



rat_j6fcfc& 

humn_5+3_corrected 



*** ******** ** ****** **** ******** ***** ** ** ***** *** 



rat_cDtfA 

human_5+3_COrrected 

3ZEOS CDSIA 5 



GCGT!TTTACGGi^CTCACCTTGCTCCGCT 

* ***** ** ** * ** ****** **** ** * ******** * *** ** +** 



husan_5 + 3_co r r ec t ed 



CTCXStfjCCSUSACAC^ 

CTCGaCCa^^ACATTTGTCTCTT^ 

CTCCATCCAGftGaCATTTGTCTCTTT^^ 

***** ***** *********** ** ***** **************** *** ****** 



rat^cBNA. 

husSn_5+3_corrected 
mxa cDNA 5 



*** *** * ***** ********** * ************ **** ********** 



rat_cDN& 

human_5+3_corrected 
rous cDNA ¥ 



CA^GCOWLCCTAGftftAG^^ 
TA£GCCT<^CaiA<^CAGCC^^ 

***** ******* ***** ** ************************* ******** 



rat,_cDNA 

human_5+3_corrected 
mua cDHA 5 



AAA<3T<X3TTGTCTG&CTG^^ 

AAAiCTG<^GTCC<aGX<j<5&^ TTOTTTG 

************ ** ***** **** **** ****** * * 



rat_ cDNA 

hiaESn_5+3 — corrected 
mus cDNA 5 



* * * * * *** * ** *** * * * **** * 



rat_cDNA 

human_5+3__C0JT^^cted 
mus cDNA 5 



Aj(3XGTTAGCTAXGGTCT^^ 

-CCCC^AACCnrcCGCCCCA. — CTTCCCTACCTACCCSVrTC — CCA5*TTTTtGGC 

** * * ** *** ** * * * * ***** 



rat_cB8ft 

h«saau_5+3_cc>rractftd 

WXS- CDNA 5 



CCCTG&A*^»AftjG^ 

CCTGOCATTCCCC-- — — -~ — 

* * * ** 



rat_cDNA 

faraaan_5+3_eorr&cted 
mus eDNR. 5 



AAGATTTCA3!AGAACCCTTTGGCTCCT 



rac_cDNR 

hraoaaJ5+3_correct^d 
yi^ff cUNA 5 



A<3GCC<3AGftT<^CrOTAGXAr^^ 
AfcGCTftACATQGTCTGOU^ 
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htmti_5+3_corrected 
mils cDNA 5 



rat_cDNA 

hu&an_5+3_co rr ected 
mus cDNA_5 



ATTACGGTOtfSUr^^ 



humaix_5+3_coxrect*d 
mus cDNA 5 



TAGIAGJlft&GGAAGCCCC^ 



rat_Ctm 

human_5 +3_correct ed 



TGGCTCCTAAGCCTSAA^ 



rat cDN& 

human^5+ 3_co rxec ted 
mus cDNA 5 



rat^cDSA 

humik_5+3_corrected 
mus cDNA 5 



CaMACAGATCC&GTAC^^ 



tfctjoDNA 

h.uman_5+ 3_co rr«ct ed 
mus cDHA 5 



CGGAGAG^CAAAT^ 



rat_CDtt& 

hu»an^5+5_oorrected 
mus cDNA 5 



rat_cDNA 

human_5+3_corrected 
ans cBftA 5 



rat_cDNZV 

hum£n_5+ 3_correct ed 
mas cDNA 5 



rat_cDNA 

htQQaa_5+3_COrrected 
mus cV8K 5 



GCGTAmXCaC^G^TAfgGCAGCAA^ 



sue cDNA 5 



CTCTGGXftlSaACCTTT^ 



rat_cDNA 

hiaaiaa_5+3jsorr$cted 



C<^GTGAGACGCTCGACCT 



human_5+3^corrected 
mue cMJA S 



G^TTCTTCCAGGOMCACTG^ 
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r*t_cDNA 

hw^_5+3_corrected 
wis cWA 5 



huniaa_5+3 — corrected 
mas cENA 5 



ratjeDNA 
mus cDNA 5 



AftA&SAJNSOTTGft^ 



human_5+3 corrected 



GTGTTTCCCTTMGCAGCCAG^^ 
TTGCTCMCTT^GGASCC&CC^ 



h\saan_5+ 3_corr acted 
was cDNA 5 



A«3TTGGAAAkrac&C^ 



human_5+3_corrected 
saw cDNA 5 



<^GK3<^<3GG<3A£?C^ 
AGCGACGTGGAGATS^^ 



rat^cDNA 

huzaari_5+3_cc>rrec.te<i 
mus cDHZV 5 



CfGCTC G<^G£U\T TGACCC 



rat_cDNA 
sous c5ka 5 



C^T^CAAAAAAGCAAQAA^ACCACAC^ 
CTATGtt^GACAAGCGAG&AAAIA^ 



rat^cBNA 

human_5+3_corrected 



rat_cENA 
20U5 cDNA 5 



TGGTCCCGGCCACnra^^ 



htao5n^5+3_corrected 
heus cStfA 5 



rat_ow*A 

hum*n_5+3_corr^cted 
jaus cSna 5 



CftGaAACAC!3ACMTCS^ 



rat_cD$A 

human_5+3_eorr*cted 
iqizs cDtfA 5 



rat_d>NA 



AAAACCC^'I^^TTATCTrrCCATC- 



AGTAGCX 6AAArrCGAGATTCTG-^T 

AACATTCATCCACTSTCTTTO 
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rat_cDNA 

hmnan_5+3_corrected 
imis CDMA 5 



. casoca — GGAAjG&GCfta^ 

GAGGGfift<3k<3GAA<3*^^ 



hu»anJ5+3_corrected 
crus cDNA S 



ACCTATGGCCMACC^ 

AARG^GinS^^GTCaftaK^^ MSRG 



rat_cD&A 

h\aS\J5+3jC3orre«l:ed 
mud CDNA 5 



AGCCAATAAAffCCAACASAM^^ 

AC^CAOTAAATACCACAAATASTC^T C^CATCTGTMGAS^GTGAGTGAAC 



rat_cD^A 

humafc — 5+ 3_ corrected 
mus CDNA 5 



CC&GGCACAATC&CT!K^^ 



ret_cDNA 

human_5 +3_co r rected 
mug c5na I 



CAGATCCAC&CACAGCTGCTC^ 



rat_cDNA 

&URtanj5+3_<»rrectsd 
mus cDNA 5 



AC^TCCCGCTOTCAGACGCmQGGAGGCA^ 



JPat^cDNA 
CDNA 5 



AAAACCCACOTAGAACXX£AGTTCT 
TC^CCATATAGAACTCCAOT 



rat_cOTA 

2*uman_5+3_eorrected 
unis cDNA 5 



TC^GGGACCTGCT&AjCAft^ 
O^GA(aOTCTTCTGAAfiAft&^ 



rat_cDKA 

ht2n£&_5+3_c6rr*ctfcd 

JSUS CDNA 5 



<3CCArac^^CCTTCC&CA^^ 



rat_cC*3A 

iiunw_5+3_corrected 
ntus cDNA 5 



htzman_5+3_corrected 
mas £D8k 5 



CCACIOTOCjTCAAGAfiACC^ 
CAACTCTAtfTCCAGAATCa^^ 



rat_cDNA 

huroaa_S+3_correctftd 
ams cDNA 5 



mus cDNA 5 



rat_cPNA 

human_5+3_corrected 
mas cDNA 5 
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rat__cDNA 



TTTCA^TGCTATCACCAA^ 



Xat_ct>UA 

huaan_5 + 3_co rr ected 



ra.t_cDNA 

IiunSa^S +3_corrected 
nrus cDNA 5 



rat_cBNA 

htanan_5 +3__corrected 



mas cDNA 5 



CftAATSCCATan^ACOT^ 



rat_eDi*A 

huxaan_5 + 3_co rrected 
inns cDKA 5 



QCX^&AG^O^ TGCXTCC 
CCT GGAAGTCriCCMCAAAGAAGGiUaC^' XCCCTl^-CC^CXCT T2ACCCTATGCTTCC 



rat_cDNft 

humaa_5+3_corrected 
sms cDNA 5 



<3AcmGT<xflAcr<3c^^ 



rat^cENA 
JOUS cDNA 5 



&<3^CaAAAGACC$ftAt^ (&ft^Cft!3&ftAC^ 



rat_ct>Kft 
mus c5t3A 5 



TCCAGACCaGAGTTC^^ 



rat_cDN& 

lniinan_5+3_correct€<i 
mtie cOHA 5 



rat_cDJ?A 

buxnart_5+3__eorr£Ctfcd 
laus cDMA 5 



TTCAAGAACTCTT&ATCT^ 



zatjcDNA 

huioSi_5+3_corrected 



rat^cEm 



AGTG^AAAGAGCCTGA GfcCAC— ACC^CCftCC&CTOCTCAG^ 
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h-uman_5 +3_cor rected 



i^TGCXaGTTCCCfilCTCOCCTCCC^ 



TCCCATTTCCGGOTTTJVX^^ 



ratjCUNA 

hximaii^5+3_co rrected 
sens cDNA 5 



rat_cDNA 

fcvatra_5+3_corr&ot©d 
rous cDNA 5 



xatjoWa 

hWa&_5+3jC#rrected 



ratjcDWa 



-CCTTCCAGAO<3CCAGCACTC^OC^ 



rat_cDNft 

&tun&A_5+3_eorrect©d 
mas cDMA 5 



human_5+3_corrected 



rat_cDNA 

hu«Sn_5+3_oorrected 



&uniS[nJ5+3_corrected 
mus cDNA 5 



ratjcDNR 

hun£n_5+3_coxrected 



rat_cDNA 

human_5+3_corrQcted 



human_5+3_©?«scted 
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kuraan__5+3_co rrected 

HOIS 5 



rat_cDN& 

fcua£n_5+ 3_corrected 



<^TTCTTGCAAACC&AA^ 



rat_cPKA 

hmsai w S+3_corrected 
mils cDKA 5 



mus cDNA 5. 



CAGOVCCACCTGTTAT^ 



pus cDKA 5 



GTTT<^AACT<^CTGCACTGC^^ 
GlTTAAAACTGCCCTGTACTGCft^^^^ 



mis c6na 5 



buman_5+3_correct«d 
rans cDUA 5 



rat__cDNA 

hunan_5+3_corrected 
mis cDNA 5 



rat_cDHA 
mus cDNA 5 



CCAGCCCCA<3<3»TAGZ^^ 



human_5 *-3_co rr ected 
sous ct>NA 5 



rat^cfcNA 

huinan_5+3_c©*rected 



CCAAGGC^TO^GACCAOT(5<5 GCJWaCTGGATOCACGTCTAOCCEAAITG 



r&tjcDNA 



GATCCTTQGTGCTTGGGfTCAGTGACGGAAJi^ 
GAICCCTGTTT&FIGGATCAG^ 
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rat_cDNA 



mas cDNA 5 



ACTGOMGGCCTC^<^CTCCCCTCTG^ 
Z^GCAAAGCrrCCGGCXC^^ 



huaan__5+ 3_co rrac te<i 



rat_cBBl& 

human_5+3_correctea 



ratjCOSA 

kuman_5+3_corrected - 
mis ccm 5 



husRan._5+3_ca rrec ted 
mis cDNA 5 



mus COMA 5 



rat_CBN& 



hvman_5+ 3_corrected 
Bms cDNA. 5 



rat_CDNA 

human_5+3 corrected 
ana cDN& 5 



rat_cDN& 



huraaa_5+3_correcte<l 



human_5+ 3_correcte<3 
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ratjctftR. 
rous cENA 5 



hu»an_5+3_corr©cted 
jnus cDNA 5 



hunSn__5+3 — corrected 



GCTCTGTGGAITGGGAACCC^ 
GCTCTGTTt^GGTAACOCACGZiC^^ 



rat_cDHA 

hunSn._5+3_corrected 
mas oSk^ 5 



TTTCC^TGGACCACAAAGTI2\2^ 



h\ttftan_5-*-3_correctQd 
Mia cDKA 5 



hnman_5+ 3_corrected 
KluS CDNA 5 



xat_cDMA 
SSUIS cDNA 5 



CftSSGATGGTGaASA^^ 
CAGGGACAOTft&AAGGC^^ 



ratjcDNA 

hun^n_B+3_ccrrected 
mus cDtJA 5 



rat_cDNA 

human_5+3_corrected 



TGGAEGGAAAATACATACTG^ 

TTA&TGGGAA^ACATATT<X^ $ACRATGGCACXrTT^XC^TAAA<3 1 AGCAAGAGCTT 



rat_crm 

huaaan_5+3_corrected 
mas cDNA 5 



ACSACCBAiGGAAATOA!^^^ 



h\i»aan - 5+3jcorrect€d 
XOUS cDKA 5 



lmman_5+3^Corrsct^d 
mus 5 



TCMGASGACAGGGG^^ 



human^5+ 3__cor rected 



AAATC&CMGGGAGA3HKOT^ 



rat_cDRft 

human__5 + 3_com*ct®d 
muss cDNA 5 
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xat_cDNA 

&2nan_5+ 3__coxrec t £d 
mus cDHA 5 



ratjsENA 



AraTCC&t^ACTCTSAC^ 

ATATTOAOT^ATCT GAC&E GAA -A2M5AAAGT-CAACM CATCXGGGCA 



huaan_5+ 3_corrected 
toua cDNA 5 



GAATTTAmTTTG^GAAOTr^^ 



ratjzDRA 

human_5+3_corrected 
mus cDNA 5 



ATACATTEACftOTWTA^^ GACTTOTAAATGAAA 

AXACATTTACA^AMAAAimiA^^ 



ratjcONA 
cDNA 5 



GCATTGTGfiACTGM ACCGAGTCTCTC--TGGft5^ 

GGATTA3^*ftAGT(M<^TAC3^ 



r^t^cDNA 

huroaxi__54-3_correcte<l 
roue dftJA 5 



GGCACmGAimOCCAACAAAT^ 

GGC3UCTTTIATTTT(3CCAAGaA AC^TTGAAACGGTICACTAT 



ratjCDNA 

hi»*tanj5+3_correctdd 
&ae Ct>NA 5 



GAAAT^CAMCGGCTAATGCACCTGAATTCT 
AA^AAOVAAfGGCO^PGTAOT 



rat^cDHA 

human_5+3_caxrected 
mus cSsa 5 



AGTTGCCASCTGC^CGtKH^^ 

A CCMTTC^CTAGTCTCC^ A 



rat_cDNA 

husfi^_5+3_eorr£cted 
ans cDNA 5 



ATGGAGTCAACGGG^GATTA^^ 

ACAGAGACfUVFGGAAAASA^^AAATCXGC AATCT— AT'GTATAAAIATTT 



ratjcE&JA 

httffian_5+3_correctftd 
jwis c5ka 5 



rat_cDNA 

huniaa_5+3jco£r«cted 
muz cMJA 5 



T<31?CNCT^TOEATAAACA!JTTT-GAT 

TGT (3<n^TATAAATTKOTTGCTAAAACCTACA^ 



AfiAA 



(rat_cDNA: SEQ ID NO:7) 
(human_5+3 corrected: SEQ ID NO:8) 
(mus_cDNA_5: SEQ ID NO:9) 
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Figure 15 



r*t 

hu»aa_5+3_e©rrected 
aouseTs corrected 



HQfVKGREVSGLLI SLTAVCLVVTFGSfcACS^^ 
MQKRGREVSCmSLTMCLm 



rat 

hutBaa_5+3_cerrected 
xaouse 5 corrected 



rat 

mouse 5 corrected 



rat 

human_5+3_corrected 
aaouse"£ corrected 



IFKTSFIKYLFLSDEEXTSXPKE^SYKP^^ 



rat 

kuttan_5+3j3orrected 
mouse 5 corrected 



rat 

human_5+ 3_co rr ected 
mouse 5 corrected 



rat 

hurnan_5+3_corrected 
mouse 5 corrected 



rat 

huma»_$+3_corrected 
*ouse 5 corrected 



rat 

hu&an_543_correeted 
mouse 5 corrected 



rat 

huaan_5+3_correcteci 
mouse 5 corrected 



ADS ETJflGLYHClSTNDaa^^ 



huma&__5+3_co rrected 
mouse 5 corrected 



rat 

human_5> 3_co r rect ed 
mo\ase"*5 corrected 



rat fiRDiaHTt^DSTl^BTI^HBBQL?LSMW 
buaan__5+3 w Correcte<i YB^TI£RROTSTJ3KIU^^ 

mouse 5 corrected • « * — ~— 
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rat PIATOLVELTDEEKQ&SGMIPro 

hU2Saxi_5+3_corrected PWTQLPNI PGESDDSS AI1HE ^FKVFATKALNLP AKTVTADS RII S PS PHTNINYGT 

au?use~5 corrected — 



rat EVS -TVNPQTLQSEHLPDFKL FSVTNCTAVTKSMNPS IASKXEDTTNQNP 1 1 X FP SV 

human_5+3_COrrect^d EFSPWNSOXLPPEEPTDFKLSTAlKriTAMSKNIN 

mouse 5 corrected ■ >■ — — — — — — ~ 



rat ^II&SAQA(3^--SQSARP^ 

hU»an_5+3_jcorreCted TEFQDS I^^GKGREHFQ^PPITVRXMIKPVHVKMLS STTNKL — LLESVNTTNSH — QX 

mouae^Sj 00 *** 0 *^ ■ — 

rat PlTC^SRPSSSDISSKTT/toPSFSSHPSGSH^ 

humaa_5+3_corrected SVBEVSEPRHNHFYSSTTQXI^STFPSDBHT!^^ 

mou^cTs corrected 



rat IWSRGRVKNPHRrmRimSHK^ 

hmau_5+3_corrected IGGRGRI I S PYRT PVLRRHRYSI FKSTTKGS SEKS TT AFSATVLKVTCLSCLPRERLTTA 
mouse 5 corrected — , 



rat TAZOSVPSSSHSALPKTNNV^IASEST^ 

human_5+3_corrected IMLS FP^APITFPK^I^VPSE^TTLVQNPIilJ^KP- -STVEKETPIIKXFRTBI 
mouse 5 corrected — > — - 



rat NHVI pte&smtsapt? svslqks fvd^ghlsmpgtiqtgxdsvsttplpspls TP — SIP 
huntan_5+3_corrected SQVTPTGAVMTYAPTSIIWEKTBKV^ 1 
mouse 5 correoted — — 



rat TS^KFSKIOTPI^OIFVNNQKKEGM^ 

hTOan_5+3_corrected AIT^SR£KIPWQQNiYNNHNPK(3fcI>ftK^ 

mouse_5_eorrected — „„ 

rat WSSSeB^TimXQ^^ 

h\anan_5+3_co rrected TLSTSVMQI PSOTi«3*TMHTTTKXHi^~St«M 

mouse 5 corrected , _ ,. — 



rat ISTEfPT-TOSFWtfaSVJWSETQRTRSKE^ 

huaan_5+3_corrected ISTQTMPimPTFPasviryE^^^ 

saouse^Sjcorrected — — 

rat SlttTT2U>TPLAFSHSPRC^DG(^ 

huaan_5+3jcorrected TBMTP — PAIAFTHSPPESnrGX $STI SFHSRTIMIaTDTIBBLftOI^TQTI^TIMErrT 

mottse_5_corrected - — . - — , « , — 

rat I£$K$QESTTVKRAS-DTP-PFU,^ 

human_5+3_corrected LSSKSHQSTTTKKASLDxmPm 

j&ouse 5 corrected 



rat VTIYESSRBNTDLraPSAEASPttPEII^ 

&UTnan_5+3_corrected VKLHESSKBN — L$4PSSQLEP XT$6TStILLRSTPMPMiTTVKSQN$J^TPSP 

aouse^Sjcorrected — — - — — 

rat WPSHKYQLS^SETIEKG^ 

huroan_5+3_corrected WASYQF*raKPYSDIAEKGKg£EV$H^ 

mouse 5 corrected : - — «« — „, 
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rat ^SKHLPWSLPKTLI2CKP£IIGGK^ 

hUE^_5+3_corrected TTSKLLPFD SLSRYI FEKPm VGGKAASFTI PiU^SD?VH'PCEAVGNPLPTlHWTKVS--GL 

Rouse^ 5 — corrected — ~. 

rat £IS0<^KSRFHVIiPSK3TL$I8&^ 

human_5+3_corrected DI^GNQHSRVQ^m3TI£IC&^ 

a©uee_5_corrected - 

rat BHVKSITVHFGSTVEI^CRVE 

htanan_5+3_corrected KKTKEITVHSGSTVEIiKCRftE<^ 

mouse"~5 corrected — — « : — — — — 



rat I»SLYI>IUSmCV&$!re6<^^ 
huaian_5+3_corrected LSIYDKGFYKCT^l^^DSLLVKIQ^^ 

mouse 5_corrected — - — . 



human_5+3_C0rrected PQPSVYSrVLSDGTEVKFtiQFTHSKLFLI^^ 

ffiotiae 5 corrected > 



rat VILTVEE<^IPRIB!I&SQftira^ 

human_5+3_corrected VMLTMEERVT S PRI EA&SQKRTEVKFGDKLLLNCS AT GE PKPQ IMWKLPSJC^WDQ- 

mouse's corrected 



rat GSRIHVYPNGSLWGSVTOKDftGD^^ 

human_5+3_corxected Gs^HVYPNGSLFiGSvrsKDSSmcv^ 

mouse_5_corrected 

rat VIBGKDEtfWKASGSPTO^ 

xaouse 5 corrected ~ — 



rat GMaE3OTYic$AQrtr£<^53^^ 

mouse S corrected 



rat KPNVFm,PSNNVI SFSNDRF^^ 

kuman_5+3_corrected KPKI£TO*LPSNimSF$ID£^FHMGSLTI 



mouse 5 corrected 



rat IVSKPFLINGLYMmTVIKftXAIJifl^^ 
hunan_S+3_corrected VVSKPPLINGLYTNRTVIKftXAVEHSKKH^ 

aouse 5 corrected — — — — ™- .„ 



rat V?7HPNGTIJEMRKIBLSDSADFrCVVR5 

hut»anJ5+3_<»rrected ITVEKKGTI£IJ!KVRI*SDSJRJ^ 

mouse^S^corrected — — 

rat Qfi OKPV aL!K^TOGNPPPEITgIL 

human_5+3_corrected QIXSKSTALNCSVDGNPPPEIIWILFN^ 

mow 5 corrected , . 



rat &CAA?LNKVGYIEKLIIiEIGQ^ 

mouse 5 corrected ™™, 
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rat <3GmnCDRPQVp<3f<YILH^ 
human_5+3_corrected $GTVVVkPQI8GKXII^^ 

m©u3e_5_corrscted 

rat il^EaStttRKrGBZW^ 
hu3nan_5+3_corrected I!PHI#?fc$IVTI£<3AAF0l£^ 

mous o^5_corr act —————— — — — 

ra t LVI^LOTS DS GVYKCRAQNLLGTDYATT YI Qf7 
himan_5+3_COrreOted 1VIQNPQTS DSGIYKCTAICNPLGS KCAATYIQV 
mouse_5_correctecl 

(rat: SEQ ID NO: 10) 
(human_5+3_corrected: SEQ ID NO: 1 1) 
(mouse_5_corrected: SEQ ID NO: 12) 
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rat 

human 5+3 corrected. 



*.*.** **•*:•.•***.*** ^.*********^***************^** % *** 



rat 

human 5+3 corrected 



*********^** * a ** B ** ; **it******-**** « ^-***** # **.***********. . 



rat 

human 5+3 corrected 



QKI>TF YGLRSLTIU^HMDHNNIEFINPEVFYGLN FLKLVRI»B<3N0LTKLH^ t)T FV$LS YLQ 

.******* ** *************** **** ************************** 
• • » tt * 



rat 

human 5+3 corrected 



*** ***** ************t*******J********W*******>***^ **; 



rat 

human 5+3 corrected 



******** ** ********** *♦*.*.***: • ** «r-***** ******* ** . 



rat 

hujc&n^S+S^corrected 



^T$PQDFI£PrGSI£IKMTXXS^^ 

** ***.*; ********** ft**.*.********-***** *********.** j**** 



rat 

human 5+3 corrected 



*** ; ** *******-***********»* •****+ J ****** *** 



rat 

human 5+3 corrected 



* : *»** * ; ft*****************.************. • ***** ** 



rat 

human 5+3 corrected 



,***; ******* 5 , * ,*** • *** ***♦ *************************„******* 



rat 

human 5+3 corrected 



ADSFDTGVYHCISSNYDQiWm^ 

*****-*.*****.* ************** *♦ »f **:»#*** ******** ***• 



rat 

human 5+3 corrected 



pimiswilpgotvfsepsri^ 

******** j*** a *. i^***. ; j***************** .*.******** f ** *j* 



rat 

human 5+3 corrected 



SVQKKGQSMVEHDR^^ 
SVKMKGQRPLEKDGETEGSGLOESl$PI&^ 

** : **+* **** +• **** # * 4 -* # 4 **** ## «* ♦ *** j*^**.^*^ 



rat 

human 5+3 corrected 



KRDLtHKRRGDSTLRRFfcEHBSQI^ 
YKEt^LQIUlGDSTHRRFRENRF^ 

-*** ******* *****•**>;★ ********************* ,* a ******* a * 



rat 

human 5+3 corrected 



PWTQLPNIPGEEDDSStmALHEEfc^ 

*:.. * ::..**.*:***:. .***** *** •*,*: **** . - . . ** * ** 



tat 

human 5+3 corrected 



EVS-TVNPtfTLQSEHLPDFKLFS^^ SV 

* * ** * * * **** . « **••*•**•••*•*-***** «** 



rat 

human 5+3 corrected. 



.*..** * jr* ** *♦* * ► * ** * * •*. «* ***• * 
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rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



PITOT$RPSSSDI$$HTTADP$F$SH^^ 
SVRi^SEPRHNHFYSHTTQILSTSTFPSDEHTM 

^, ** % * **** * * ; ^**^** : * * * ****, * * : ** *::**:*. 

* IWSWRVKNPHRimRl^ 

IGGRGRIISPYRTPVLPJ^YSIFB£TTRGSS^ 

* ***- ************ * - *t . .** * :* . * **„* 



rat 

human 5+3 corrected 



TAAI^SVPSSSRS^PfCTNWG^^ 
TW^LSFPSMPITFP^IARVPSEESTTLV^ 



******** 



* ****** .**.♦****.; .* 



.* * * ** * 



rat 

human 5+3 corrected 



jraVIPTE&Star8APT5VSIi<^ 
gQVTPTGAVMXYAPTSIEMEKKiKVN?^ 



.* ** * ** ***** . « *. 



.*** *•,*„,, j j < 



rat 

human 5+3 corrected 



; *.«*.** * ;* ****** .* *.* j *** :* ** **** . * 



rat 

human 5+3 corrected 



TLLT3PPPftI£TTfcfl#ftTQNK)GTEW 
TLSTSVMQIPSNTLTO*HHTT^^ 

** ** * *-*5lM. * ' . ** • *• ** *** * * 11 



rat 

human 5+3 corrected 



STETPT-VTSPmtfA^IMSETQ^ 

SaXlTAIPATTPtPPPASVlTCETQTERSKaQT IQR£<ffiPQjaC^DDPNISFDQSSGFTTPT 
**:*♦ ,*:** *** **« . * : * :* * **::t r 



rat 

human 5+3 corrected 



ALTTSUTCFl^SHSPRaDDG^S 

Z^P— PMAF!raSPPEKT!rciSSTl$FHSRn)^ 

*♦* ******* * *£«*•** * * ** - **** * * **** 



rat 

human 5+3 corrected 



LSKSQESXTV£&AS-OTP-PFIiSS<a^^ 

SSKSHQSTTTBKASLDTPIPPFLSS SATLKPVPISPPFTQI^VTDTRGDSHrRIWTNTVV 
***..***,**** *** ******^«_ »****: .***•; * ;* ** 



rat 

hvraan 5+3 corrected 



TIYE$SIU3NTDK&PS£SMPi^ 

KLHESSRHN — LQMPSSQLEP LTSSTSNlJJISTPMPALTrVKSQNSKLTPSPW 

******** ** **.. * * ***** ** ***** **** 



rat 

human 5+3 corrected 



PEHKYQLKSY$ET1EKGKRPAVSMS PHLS ItP^^rHASHWNTQKH^KS VFDKKPGQKP- 
AEYQFTO1K?YSD:HIEKGKKP^ 

t *«»j *.**; ****.* *** # .*„**:*_.***: **::?** ***** *:* 



rat 

human 5+3 corrected 



T SKHLPYYSLPKTIiLKKPRl I GGKAAS FXVPANSDVFLPCEAVGt>PXPI IHKTRVSS GXE 
TSKLLPFDSLSimFEKPJUtt^^ 

*** ** ? ** f * : ; ************** j*****^ ************ ******* * ; 



rat 

hman s+3 corrected 



I SQGTQKSRFWLPNGaJLSIQRVS I QDR<30YLCSAR^PIlGVDSFHVSLSWFYPARI LDR 
LSRGNC^S^VOVLPNGTLS IQRVEIQD*<3$YI£SASH^^ 

-*-* *.** ************* *********** * -* *****;**** ******* 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



;- ****** *********.** * **************** * J***: . ** ****;;;** 

SLYDRGFYKCVZ^KPSGQDSLLVKI QVTTAPP^XEO^OAIVGVLGGS LKLPCTAKGT P 
SITDRGFYKCV7\StfP<SGQDSLLVKI^ 

♦j*************^************.********.**^***^ * ************ 



rat 

human 5+3 corrected 



QPSVYSVLSDGXE^PXQmSKri^ 

♦**+;*** *********** # ******* b • * r * ******************* 



rat 

human 5+3 corrected 



i:LTVE£G£TIPRIE^A£QKW^^ 

IVnVTWmWTSP^aEAASQ^T^^ G 

* ********* A********************-*.********;** * 
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rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



SWIHVYPHGSLFIGSVTEKDSGVYLC^^ 

* *********, : -******+j* ************* ***** ***^***+- .****.*** 

**************************** J i **, ******;**;*****♦*******;** 
VAEEGDYTC^QNTLGKDEWX^ 

******* * ******************* ****** **. *..**************** 

* • . ******** ;**** **(***** •*!*.*** A*********.******** *«**; 

VSKPPLINGL YltifcTVI KATAVRHSKKHFDCRAEGTFSPEVMW IKPUNI EXTAPYYGSRI 

*********** j*. *******.*****;***** ;* ****,****,***;***; 

TVHKNGTTiEireNV&LSD^ 

*** *****.*********** ** a *_*****************************; ;** 



rat 

huaan 5+3 corrected 



rat 

human 54-3 corrected 



** _ ^************** ****♦**;♦;* **♦*,***.*« *;**:,:***"* 

C2y*F2^GYIEKLVII£lGQKP^ 

*************- ; ************ ** ( ; *** . ************** ; Hi * # 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



GHVIDRPQVTCKtflLHEHGTL^ 

*;*.****; .******«******* .**}*******;******• J ** J*.** • ******* 

INYLE&NMLMCCGE^QIiHCVaL^ 

* **,;: *** *♦********«***•;*** * *****^*; ** ** **** 



rat 

human 5+3 corrected 



VTQN?QTSDSGIYKCTAKNPI^DYM,TyiQVI 
*+** ******.*** *«* **.***.*****■ 

(rat: SEQ ID NO: 13) 
(human_5+3_corrected: SEQ ID NO: 14) 
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Figure 17 

MQKRGREVSCLLISLTAICLVVTPGSRVCPRRCACYVPTEVHCTFRDLTSIPDGPANVER 
VNLGYNSLTRLT1JNDFSGLS 

KDTLYGLRSLTRLHLDHNNffiFINPEAFYGLTLLRLVHLEGNRLTKLHPDTFVSLSYLQIF 
KTSFIKXLYLYDNFTSLPKEMVSSMPNLESLYLHGNPWTCDCHLKWLSEWMQGNP 
(SEQ ID NO: 15) 

Figure 18 

MKVKGRGITCLLVSFAVICLVATPGGKACPRRCACYMPTEVHCTFRYLTSIPDSIPPNVE 
RINLGYNSLVPJJvIETDFSGLTKI.ELLMIJiSNGIHTIPDKTFSDLQALQVLKMSYNKVRK 
LQKDTFYGLRSLTRLHMDHNNffiFiNPEVFYGLNFLRLVHLEGNQLTELHPDTFVSLSYL 
QIFKISFIKFLYLSDNFLTSLPQEMSYW^ 

KCKKDRSPSSAQQCPLCMNPRTSKGKPLAMVSAAAFQCAKPTIDSSLKSKSLTILEDSSS 
AHSPQGFMAPFGSLTLNMTDQSGNEANMVCSIQ^ 

VCNIDYGfflQPVWQILALYSDSPLILERSHLLSETPQLYYKYKQVAPKPEDIFTNIEADLR 

ADPSWmQDQISLQLNRTATTFSTLQIQYSSDAQIII.PRAEMRPVKIIKWTMISRDNNTK 

LEHTVLVGGTVGLNCPGQGDPTPHVDWLLADGSKVRAPYVSEDGRILIDKSGKLELQM 

ADSFDTGVYHCISSNYDDADILTYRTrVVEPLVEAYQENGIHHTVFIGETLDLPCHSTGIP 

DASISWVIPGNNVLYQSSRDKKVLNNGTLRILQVTPKDQGYYRCVAANPSGVDFLIFQV 

SVKMKGQRPLEHDGETEGSGLDESNPIAHLKEPPGAQLRTSALMEAEVGKHTSSTSKRH 

NYP^LTLQRRGDSTmPJ^NRPJiFPPSARRIDPQHWAALLEKAKKNAMPDKRENTTV 

SPPPVWQLPNIPGF^DDSSGMLALHEEFMVPATKAU^LPARTVTADSRTISDSPMTNIN 

YGTEFSPWNSQILPPEEFrDFKI^TAILTTAMSKMNPTMSSQIQGTTNQHSSTVF^ 

ATEFQDSDQMGRGREHFQSRPPITVRTMIKDVNVKMLSSTTNKIXLESVNSHQTSVPJEV 

SEPRHNHFYSHTTQILSTSTFPSDPHTAAHSQFPIPRNSTVNIPLFRRFGRQRKIGGRGRnS 

PYRTPVlJlRmYSIFRSTTRGSSEKSTTAFSATVO^TCI^CLPRELTTATAALSFPSAAPI 

TFPKADIARVPSEESTTLVQNPLLLLENKPSVEKTTPTIKYFRTEISQVTPTGAVMTYAPT 

SffMEKTHKVNASYPRVSSTNEAKPJDSVITSSLSGAITKPPMTnAITRFSRRKIPWQQNFV 

NNHNPKGROiNQHKVSLQKSTAVMLPKTSPALPQRQSSPFHFRRLSTSVMQIPSNTLTT 

AHHTTTKTHNPGSIJTKKELPFPPLNPMLPS IIS KDS S TKSIISTQT AEP ATTPTFP AS VITYE 

TQTERSRAQTIQREQEPQKKNRTDPMSPDQSSGFTTPTAMTPPALAFTHSPPENTTGISST 

ISFHSRTLNLTDVIEELAQASTQTLKSTIASETTLSSKSHQSTTTRKASLDTPIPPFLSSSAT 

LMP WISPPFTQR A VTDTRGD SHFRLMTNT VVKLHES S RHNLQMPSS QLEPLTS STSNLL 

HSTPMPALTTVKSQNSKLTPSPWAEQFWHKPYSDIAEKGKKPEVSMLATTGLSEATTLV 

SDWDGQKNTKKSDFDKKPVQEATTSKLLPFDSLSRYIFEKPRIVGGKAASFTIPANSDAF 

LPCEAVGNPLPTIHWTRVSGLDLSRGNQNSRVQVLPNGTLSIQRVEIQDRFQYLCSASNL 

FGTDHQLH VTLS V VS YPPRILERRTKEITVHS GSTVELKCRAEGRPSPTYTWILANQTV VSE 

SSQGSRQAVVTVDGTLVLHNLSIYDRGFYKCVASNPGGQDSLLVKIQVIAAPPVILEQRR 

QVIVGTWGESLKLPCTAKGTPQPSVYWVLSDGTEVKPLQFTNSKLFLFSNGTLYIRNLAS 
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SDRFTYECIATSSTGSERRVVMLTMEERVTSPRIEAASQKRTEVNFGDKLLLNCSATGEP 

KPQIMRLPSKAVVDQGSWfflYPNGSLFIGVTEKDSGVYLCVARNKMGDDLILMHVSLR 

LKPAKIDHKQYmKQVLHGKDFQVDCKASGSPVPEISWSLPDGTMINNAMQADD 

TRRYRIJWGTLYFNKVGVAEEGDYTCYAQNTLGKDEMKVHLTVITAAPmQSNKTN 

KRIKAGDTAAVLDCEVTGDPKPKIFWIiPSNDmSFSroRYTFHANGSLTINKVKLLDSG 

EYVCVARNPSGDDTKMYKIJDVVSKPPLINGLYTNRTVIKATAVRHSKKHFDCRAEGTP 

SPEV]VrMMPDNIFLTAPYYGSRlTVHKNGTLEIRNVRI^ADnCVARN^ 

VI^MLRRPTFRNPRNPFNEKIVAQLGKSTALNCVSVDGNPPPEEWILPNGTRFSNGPQSY 

QYLIASNGSraSKTTREDAGKYRCAARNKVGYIEKLVILEIGQKPVILTYAPGTVKGISGE 

SLSLHCVSDGIPKPNIKWTMPSGYVVDRPQINGKYILHDNGTLVIKEATAYDRGNYICKA 

QNSVGHTLITVPVMIVAYPPRrrNRPPRSIVTRTGAAFQLHCVALGVPKPEITWEMPDHS 

LLSTASJCERTHGSEQLHLQGTLVIQNPQTSDSGIYKCTAKNPLGSDYAATYIQVI (SEQ ID 

NO: 16) 



Figure 19 




Signal Peptide 
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Figure 20 
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Figure 24 
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Figure 26 
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Figure 27 
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Figure 28 
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Figure 30 
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Figure 31 
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Figure 32 
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Figure 33 
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Figure 34 
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Figure 36 
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Figure 38 





Figure 39 
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Figure 40 
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Figure 41 
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Figure 43 
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Figure 45 
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Figure 47 



^(XbaT) 

TCTAGAAGTAAAATQATCCTOAGTAGC 

AATCATCrTCTCTGTGGTTT^ 

ACCTACCXjAATACAGAGCTAT^^ 

gtagchxxxx^agaacatactagatgccatatt^^ 

cttaagaatatggttgtctctgc^^ 

(kjttagtgctaactctoactgcttgactctttaattctac^ 

atcacttggtgattagcaactggatatogaatattactaatttgtaccc^ 

ctcagctttggcagcto^^ 

actgtttctacttacaagacgcattacctgagatgagtcattiti^^ 

gaagagtcaattagacatattctgatgaagtaagcatataaa 

tgttccatgtatgctcatckjatgctattataatgtggaaata^ 

agcttatgatacrrrgtcacagagtaaatcttccataaatatcatc^ 

tttcataatccatcaattaaaaacctttagaaattttgt^ 

CTGCCCTAGGATGGTCTGTATGGTGGGCCTGAGAGaT^ 

aggagcacatcttcagaacatctgcctc^aaacam 

actcacatgtgcitcaaccacagggattaaacagtcat^ 

ggaagcctgctagaggaacaggatgtatcaggataacatccaaccttacaaaaggatgtc 

ataaccctcaccacaacaaacaacaacgacaacaaac^ 

gaactaagccatatgcagaaaaagtattatatgttctcattg 

agtcaaatatgcagaatatagacaaagatggtttatgcaagxggggatggcgaaggata 

cttgtagattagaggacacaaagcaacaactacagagtgaagtaatccagagacttaatg 

tataatatgaggactgtatttaataa^ 

actaacacacacacttac^tagagagaa^ 
tagctgataatitcatattgtacacctcaaacatagatmccaacaaagagg 

aggtgcctctcccagggcggaagagtacattcgaaagtcagacac^ 

ccacatggaggagctagagaaagtagcca^^ 

tqgaacaacattatgagctaaccagtac^^ 

aaaagatggcctaatcgg<xamctggaaagagaggcccato 

atxkx;ccagtacackk3gaataccagggcc^aaaaggg 

ggggtgggtggatatgggggacttttggtatagcattggaaatgtaaatc 

ctaataaaaaatggaaaaaaaaaaaaaaaaaaaaaaaaaggaagg 

ttcactgctatctcaacttgc^ 

gattgaagcgtctttgmtgttattgctgto 

tgtgaaaacttacggggtctatgac^tgttttm 

atatttggcaccactaccagccccagcccccmg^ 

cacacacacacacacacacacacacacacacacacacacacaorm 

c^tcatctgctc^c^ 

ctatttn'atgtctackjttccccctccccctc 

agaatttgtagcccctgago:agcccggg<xacagagcctgc^ 

cagggcttaccagcacaggaggagcaaactc^ 

cccgggtcgatggcccgcccattccx:agaagccgaggcta 

CODTCCTaXXjAGTGAAGA 

ttggcccaaggcagaggtgagcggctgctaatcccctc^ 

agtctggtagatgtgggagctogggttcagggcgagacagagw 

ggtccagga aaaggaaagtactggaggggagttggo aca AAAGCAGCQAGCAAQG GAA 

gatcqcttcagtoactgaagccaggc^a^ 

cgctttcataaacactgatoacgtgtitc^ 

gggacgtcggaaagaaggagtgatcgattagtacttqtaagtttaggtgagt^^ 
taactaacctatactattgagogaga^ 

GCAATCAGATAAAGGAAAGC TlTfGGTT^ 

gMGCGCCTCCACmCTCTGM GAGTCAG 

QCCTTTTCOTCC<^^ 
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AGATTTCCTOTCrCT^^ 
TATTGCCAACTCTTTTGTTAA^ 
TTCATTTACG<5CTCTCI^^ 
CAGCTAC(HTGTA<XATCTTTGTITATCT 

ACGCAGAGAGTATTCAAAAATATTCAOTGATGTAACAGTGACAGTC 

ATCATTAGTAAATCTTAATTTCKKyTTAAACTC 

CACnXjAGCCITCGCCACQTQCGGGTrAAAGATATTITCT 

CTIGGCCATGCTCCCCATCACTGTGTCAGTAAGCAG^ 

AGCAGACAGTGTTATG<Xn?GCAAAGTCAGAGACrCAG 

GTCCTCCCGGTCATTAGTTGGCTCTGAAAAGGCCCATGTGTCOTA 

AGAC^TGCTAGAAAGAAATTTGACCITTTTTTCT 

TTrGGAAGGTTAAG<XAAArAAATAAAACACATATTAAATAATACAATO 

GATCATATAAAGAAGTACATTCATAAATGCAATGTG^ 

TTACTGGTGCAAAGTTTTCTAAATTGCACATGTACTATTTT^ 

AAAATGTATATAAAAGTGTAAAAGGCTCTTGGTCAAACAAGAGAGTTAA^ 



TTAATTGTCCCGATAACATTATT^^ 
AATGAGAAGCTATGAAGATATGirTACAATAATAAGCCCATTIAGTGATAAA 
GGAAGCTAGCACACACTGGTTTATAAAGAGAACAGTTnXTO 
CTCTAGGGAATAAGAGTTCCTCTTTCTCCAG 
TTCTTGATGATGAGCATTAT^ 
ATGCCCTGAACAATCTITCAGGTCTT^ 
TTGGGGGATGGTGTTAATTTTTTTGCAGTTCTTAT^ 
AAACAAACAAAAAACCTCTGAAACTAGAACTACCAATCCATTACTGGGT^ 
AGAAATCTGCACAGAATTTATIXX^TACATTGTTC^ 
ACCAACTTACGTAGCCGTCAAAATATOAACGGATAAAGA 
AGAGTCCCATGTGG<XATAAAAGAGTGAAATCATG 
GGAAATCAATTGGGCTAATCAAAACAAGACAGACTCAAAAA 
CTCTGACAAAC^GAAGCTAGATTTACACTTGT^ 
TAGTTATACACTATTCTAATCTGTGAGTGTGTATAAA 

AGCTAGCTGGGGTGGGTAGGAGAGAAACK^AATGAGAGGAGTTAATAAQLAACGAAGCATA 

GTMCATAGGTG<XAGGATGAAATG<^TTAATTTGTATGCTAACT 

OKK^ACACGTTCAAACCAGGGTGAAATCCCAGCACAGAGA^ 

TTCGCCACTAACGAAGAAGCCATTTGCAGTTGCTGCC^ 

CTCCAGTCTQ ACACTG TGTATAACAAC^ 

ITITraTGCTATTTTTCAT^ 

oCTGTGGTTTTCATTIGTTTCriTro 

\GCTGGAAACGATCTGGAAGAAGTTGGGGAAAGAGAAAAATTGTA 
iCAAACAAACAAACAAACAAAAGGTTCATTTTGCCACAAAAAGGTO 
rAGTTACGACTCTTAAAGAAAATATTCCCAATTATTCCCAGAGTTGCT 
^AGGACTTTGCTTGAACIXK3CCCTATAACTCnX^ - Artificial 




Noil rite 



(SEQ ID NO: 17) 



! 
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Figure 48 



CACAGACCTTCCTCTTC^^ 

CAGCAAGCACACTCTGCTAATG^ 

OCAGACCCACACGTTTCCAACTGCTCCTGAGAOTC^ 

GAAACCTGCTGTQATCTCCATTTCTTCTCT 

TTCCTCAAACCTTTCCO^ 

TCATTGGGAACCX^GTGCTCCATCCAATGGATGGC^ 

CTTGGTCAGTGCAGIX^TITGTATCCTC 

AACTGTTITITGTTATGAITGCTGCAA^ 

TCCTTTGTX^TCCTCCTTTCAGTACCACCCTGCCm 

ACCAAAGTGATATAGATTCATCK5CATTCCTCTAACTGCTAC^ 

ACTTGGCAGGTGTAAAAGCCrGGAAGCAGTCACGAGGCAGTTTCACAGAAA 

CTGGAACCTIXXK&TTCCCA^ 

GGTCTTGCATGCTrTCTTGCAGTATTTTACTGGATAAGAG 

TAGCAAAAGTATACCTAGAATCTTCATOACAGTCAGGT^^ 

GAAGAAAGTTGGTAAATTCnTCTGACAAATGGAGATXXXC 

AGCTAAGTCACACTCATATGCAAGAATTTACCAAGGCC^ 

GCTITATTCATGATAAGGTCTGCTAGAGCAG 

AAAGGTGTAGCATAAATTGTGACTAGAGTGTGAAATCTTTA(XnXJ 

GGCAGAGCTGTTTTATCTTTACTGTAAACATTACCTGGT^^ 

TCCTCTGTTTTGTGACAGATACTrCTATGT^ 

TTTCTGTATTATATAAGCCTGGTGT^ 

CCTTGTGTCIXjTGTCCTTTTGTCATTTC 

GTCTTTTCCACAGATTGAGGGAGGCC^ 

GCTC&CTGTTGGTGACTACC^ 

AAGTGTGGGTCCAGGAAGCAGGTTGTACACX:CTGTCTGA^ 
AAATAAGTTAACUXATCTCTTCXJTCGTTTGCTAAT^ 
TGCTGCTGGTGTGTCTCCATC^GAGGGCAGAGCCAGT^ 
TGTCCATTGTTTCTTC^GTTCCTC 

AATCTGGGAATGACTTTTCTAAGAAATGCCACGTGAACCTT^ 

TAAACAAGATATATGGTGCG<^GTTTATAATCATAATAAGCTTTGAAATAATATATAACTC 

CATTCTCATTCTXKTITCCACGCTGAG^ 

AAAGTAGTGATTTATGACCTGCTTTGTTCT 

ATGTTGAAACATGGAACTITGGGGTGACCTGTATCTGTO 

TTTOTCTCCAGAATAAATGAGTTTATCAACTTCGAGGAA 

CGCCCGTGGAGTCCCACCATTCTACTTCCTGTAAT^ 

ATGTGATTCTTCCAACTCCAAATAT^ 

ACTAAGTCTATTGATlTGGAAGATCTCAGTGGTTACKlAm 

TCAGGCTAAGCTOAAAAATATGACAAATGAAATGTCAAATGT^ 

AGTGTTAG GGGGT ITrAAAGAAACAAATACCTACKT 

CTATTCTAGTTTTGAATAATGTTCX^ 

TTATCITGACClAATCIt^GTGACTTCATCC^ 

AGGTCTCATTCAACCACATGTTTATCATTTGGGATCTGC^ 

ATTCTTTCAGAGCACTGGAATO 

TGTGCTAAACAGAGAGTTACAAGACCTTTTTATGTGGATXjGCAGCA 

TGACAGAATGTCAGAGTGGAGCITCCACTGGGGGAGGGGC?^ 

TTGGAACATCACACAjUGAAATTCCAAATTATGAAATGCACATO^ 

ACTTTIXjACTCrrGA^ 

GTGAAGGAAAGCTCG<XAACTCCTn^ 

GACAGTATGCCAGTCCCAAGATCnXJTCAATATGACTGAAGACACATTGOT 

AAGGCAGGAGMGGCAAGCTACAGTGAAGCCCAGITCACTATAAAGCmA 

AGATAAGAAGGGTTTCTAATTTTTAAAll"i 11 1 1 lATTAGATATTTTCTTCATTTACATTTC 

AAATGCTATCCCAAAAGTCCCXTATACTCCCCCCTC 

CCATTTCTTGGCCCTGGTTACTrGTGA 

ACTATCTGGTCTATTGCAAGAATGGCT 

AGCCAAACAGTGGAAGGAACTTGGGGA(nxnT 
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(XX:CCGGATG<X3GAAAGGAACTCC^^ 

ACX:CTTO<KK]Knra^QAGTCTGAACC^C^ 

GCCTCTCCACTCATATGTMCAGATATGT^^ 

GATGGGGGTTAGGGGTGG<K}ATGGGGGmTCTCAAAA<^^ 
TGTTCTTACTC^GCTGOTAGT^ 

GTAAAGACTTGAAGTTCT(H<KSTfiltl(KJTGQAQOOTAT^ 

TAGGAAGAGAAGAGGAGGGGGAAGACTTGGGGGAGGGGGCAGTGAGCAGGrrGOT 

ATOAATAAGAAAAAAAAAATAAAAATAATTAAAAAAAAAAAAGAATCKKTrXKTO 

TACTCTAAAACC^TOK^TCCCCOCCCCO^ 

GATACTAACAATAGATOTAG<3AAACTATTXnTAACITCOT 

GAACCrrTTTTATTGAA(KJrTTCACACAGAGCCrre^ 

TAGCrTAATCATCnTCAGTGAATTAAAAATAACAATGAAGA^ 

AAOOATrcrrCGTATATATTTAAAAATCTAAGGTG<JTCACCTGGA^ 

TTTCTATATATGTTTACTCTATCOTATGTTAATO 

CAAC^TAGTATGTAACTTATAAGAAAGTAAAAC^TT^ 

CrrAaX^CAGAAACACTGGGTTTGAATAATTCTTATTTTO^ 

AGAATCTAATAACCTACTAAGGCAAACATAGAAGTTGAAATTAAAAAGAGTAGACAGGA 

GAAGTAATAAGGCAAATAATGAATATTTGCITTAAATAGTTCITAATGTA 

GGGTGTGATTCTtXAQACTTGACIXXATCCAAAATATCCAAAATGACT 

TTGAAACAATGTGTTGAAAATAATAAACATTTC^ 

TTGCTTTTTATTGCTGTGATAAGGACCATGACCAAAGCAGCTTAT 

(SEOIDNO.18) 
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Figure 49 
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Figure 50 

TAAGCCTTTTGCTCCCGTTGGAAGCAAAGAACGTTCCTTCAATCAGGTGAAGGCTCT 

CCTCAGAAGATTTCATGTCTCAGCTTATGTTACAAGAGGATTCAAAAGCAAGACAGA 

AGAGCTCAGGTATAGCCAACTCTTTTGTTAAATACAGTATGAGGCTTAAGTGTACGG 

CAACTCATGTGGTATTCATTTGCGGCTCTCTTCTCTTATAACTAACTCTTAAGGTGCA 

TATAGTCTCTTCTGTTTCCCAGCTACCTTGCACCATCTTTGTTTATCTAATAATAGCA 

AGCTCATCTGCTTTTTAATCATCACGCAGAGAGTATTCAAAAATATTCAGTGATGTA 

ACAGTGACAGTGTAGGCATAGAAGTAATCATTAGTAAATCTTAATATGGGTTAAACT 

CATTCATAACAGCTCCAGGTTGG (SEQ ID NO: 19) 



TTTGGAACCAACCCAGATGCCCCTCAACAGAGAAATGGGCCAGAAAATGTGGTCCA 

TTTATCCAATGGAATACTACTCAACTTATTAAAAACAACGACTTTCATAAAATTTTTA 

GGCAAATGNATGGTCTGNAGGATCTTGAGTGAGGTAACCCAATCACAAAAGAACAC 

TCATGGTATGCACTCACTGATAAGTGGCTATTTGTCTATGGAGTGATTTAAAAGGGA 

AGAAGACACATAGCTTTTTGTGTGTATAATATTAAGATGGAAATTTGCCAGTGCTGT 

TTGGCTTATGAGTGAATCTTGTTTCAGTGGATTACCGGAAGAAAATAATAAGTGAAC 

TGTAGGAAGAAGTAGTTAATCAAGGTGACAAAGTATCCTGACACATTGGGAAAAGA 

CCACAGTCCAGGAAACTGAGTCTTAAGGATTCATATTAACTCCAGTTCCCCATGTGC 

AGCTCTGAGACTTTGGCAGATCAGACACTTAACTTCACCAGCTTCCTACACAGAGCA 

GTTACTATCCTTGCCTTCACACATGGAGTGTGCCATTAAGTGCCTGAACATGAGTCT 

GACTTGTTAATAATCTTTAAAATCCAATTGTGTGTAAAGTATGTGACCAAAGAGCAT 

GGTCATGCTATTAACCTTTGATGTTCTATGGACTCTTAATTTTATGGTAGAAATGTCA 

ACAAGCTTGTGGAGGCTGGAAGATACAAGGCTTAAGAGGATGGCCTTTCAGTTTTG 

AAAGTAATTCAGTATGTGTTCTGGCATCCCTTTTCCTAAAGCAATTTAACCCCCCAA 

GTAGGCATAATTTTAATGCTTACTTCATCAGAATATATCTAATTGACTCTTCTAAAAA 

GACTTTGGTATGCATAGGATCTAAATGTAAATGTGATTTACTGACATAATAAATAGG 

AGAAACTGAGCTAGAATAGGTATAAAATATGTGCTGGCTTTCTAATAGGTCTTATAG 

GTTATATAAGAGGTGGGAAAGGAATATTTGAAACATCTAGAAGTAAAATGATCCTG 

AGTAGCGATCCTGGGAAAATACGTACTCTAACACACTGCAATCATCTCTCTGTGGTT 

TGCTGGAGCTGAGGTCTGGAAGGCTCGACCTTGGTTAGAAATAACCTACCGAATAC 

AGAGCTATGACGTTAGTCTGGAAGGAGCTTTGGAAGAATGACAAGCTGTAGCTGCC 

CAGAACATACTAGATGCCATATTTCCAAGGCAAGTGTCCACATGCGGACATCTTAAG 

AATATGGTTGTCTCTGCAGTGCTAAGGACCTTGTTCGTGCCACACAGGTCTCCAGGG 

TTAGTGCTAACTCTGACTGCTTGACTCTTTAATTCTCCCTTGATCATTAATGACTAGA 

AATCACTTGGTGATTAGCAACTGGATATGGAATATTACTTAATTTGTACCCAAGCCA 

GGCCACCTCAGCTTTGGCAGCTCCATTCATTCTGTGGAGCCCAGTCACGTGGGTTTG 

AATCAACTGTACTGTTTCTACTTACAAGACGCATTACCTGAGATGAGTCATTTTTCTT 

CACAAGTCTTTTTAGAAGAGTCAATTAGACATATTCTGATGAAGTAAGCATATAAAG 
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TGAGAGCAGCATGAATGTGTTCCATGTATGCTCATGGATGCTATTATAATGTGGAAA 

TAAACTGACTTTAAAAAAAAAAGCTTATGATACTTGTCACAGAGTAAATCTTCCATA 

AATATCATCTGCATTTATAAATTATTTTCATAATCCATCAATTAAAAACCnTTAGAAA 

TTTTGTTAACACAAAGATCCCTAGGCCCCTGCCCTAGGATGGTCTGTATGGTGGGCC 

TGAGAGATGGAGCTTAAGAACTTACTTGCTCCAGGAGCACATCTTCAGAACATCTGC 

CTCAAAACATTTATCCCAAATGCTCATCAAAGGCTCACTCACATGTGCTTCAACCAC 

AGGGATTAAACAGTCATTTTAGTCACATTTCTCAAACGGTGGAAGCCTGCTAGAGGA 

ACAGGATGTATCAGGATAACATCCAACCTTACAAAAGGATGTCATAACCCTCACCA 

CAACAAACAACAACGACAACAAACCCATAAAAATTATCACGGCAAATGAACTAAGC 

CATATGCAGAAAAAGTATTATATGTTCTCATTGTCKjGGTGTTTTTCCTTAATAGTCAA 

ATATGCAGAATATAGACAAAGATGGTTTATGCAAGTGGGGATGGCGAAGGATACTT 

GTAGATTAGAGGACACAAAGCAACAACTACAGAGTGAAGTAATCCAGAGACTTAAT 

GTATAATATGAGGACTGTATTTAATAATTCTATTTAAGATACACAGCAAACGAGTGT 

ATCTTACTAACACACACACTTACATAGAGAGAATAAAGTGATAGATACGTTTGTTTT 

ATCTTCATGTAGCTGATAATTTCATATTGTACACCTCAAACATAGATAACCAACAAA 

GAGGAAGAGGATAGGTGCCTCTCCCAGGGCGGAAGAGTACATTCGAAAGTCAGACA 

CCATTGTGTAGATGTACCACATGGAGGAGCTAGAGAAAGTAGCCAAGGAGCTAAAG 

GGATCTGCAACCCTATAGGTGGAACAACATTATGAGCTAACCAGTACCCCGGAGCT 

CTTGACTCTAGCTGCATATATATCAAAAGATGGCCTAATCGGCCATCACTGGAAAGA 

GAGGCCATTGGACTTGCAAACTTTATATGCCCCAGTACAGGGGAATACCAGGGCCA 

AAAAGGGGGAGTGGGTGGGCAGGGGAGTGGGGGTGGGTGGATATGGGGGACTTTT 

GGTATAGCATTGGAAATGTAAATGAGTTAAATACCTAATAAAAAATGGAAAAAAAA 

AAAAAAAAAAAAAAAAAGGAAGGTCAGACACCTCACTTCACTGCTATCTCAACTTG 

CAAACAGAAGGGGAGTCACAAACCCAGGACAAACCACAGTGATTGAAGCGTCTTTG 

AATGTTATTGCTGTTGTTGTTACCACCATCATTAGCATATATTCATTGTGAAAACTTA 

ACCACTACCAGCCCCAGCCCCCTTTGCCCCGCCCCCAACACACACACACACACACAC 

ACACACACACACACACACACACACACACACACACACACCTTTACCTTCTCCTGGGCA 

TCATCTGCTCACTCACCCACCCAAGCTTAATCCTTTTCCTTCCCTGCAATAGTACCTC 

TCCTATTTTTATGTCTAGGTTCCCCCTCCCCCTGTTAGGAGATGGGAGAGGTCACGA 

AAGAAAGGAATTTGTAGCCCTTGAGCCAGCCCGGGCCACAGAGCCTGCCACCAGAC 

AGGAAAAGCCCAGGGCTTACCAGCACAGGAGGAGCAAACTCGCAGGCGAGCCTGG 

GTTGGCGCTGGTGGTCCCGGGTCGATGGCCCGCCCATTCCCAGAAGCCGAGGCTATA 

GCTGCGTCACCTGCCCCGCCCTCCTCCCGAGTGAAGACCCCTAGAGGCTGAGCAGAC 

CCCAAAGGCGGTGCAATTCCATTGGCCCAAGGCAGAGGTGAGCGGCTGCTAATCCC 

CTCGGGAAGTGAAGGGACCCAGAGAGTCTGGTAGATGTGGGAGCTGGGGTTCAGGG 

CGAGACAGAGGGTGGGATGGGCAGAAGGGTCCAGGAAAGGAAAGTACTGGAGGGG 

AGTTGGGACAAAAGCAGCGACCAAGGGAACATCGCTTCAGTGACTGAAGCCAGGCA 

AAAGGAGCGGGAAGGATTATATGTAGCCTGGGACGCTTTCATAAACACTGATGACG 

TGTTTGTGCAAAGCAAGCAATTTGAGGAGAAACGCCTGGGACGTCGGAAAGAAGGA 
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GTGATCGATTAGTACTTGTAAGTTTAGGTGAGTTTGAGAACTAACTAACCTATACTA 

TTGAGGGAGAAGGAAGAGCATTCCAGCAGCAGCAGCAGCAGCAGCAATCAGATAA 

AGGAAAGCTTTGGTTAGTTTGGAAATGTATGATACCATTAAAATAACAGAAGCGCCT 

CCAGTTCTCTGAAGAGTCAGTCCCCCAGCTAGTGAAGACTAAGCCTACTAAGCCTTT 

TGCTCCCGTTGGAAGCAAAGAACGTTCCTTCAATCAGGTGAAGGCTCTCCTCAGAAG 

ATTTCCTGTCTCTGCTTATGTTACAAGAGGATTCAAAAGCAAGACAGAAGAGCTCAG 

GTATTGCCAACT(mTrGTTAAATACAGTTTGAGGCTTAAGTGTACGGGAACTCATG 

TGGTATTCATTTACGKjCTCTCTTCTCTTATAACTAACTCTTAAGGTGCATATAGTCTC 

TTCTGTTTCCCAGCTACCTTGTACCATCTTTGTTTATCTAATAATAGCAAGCTCATCT 

GCTTTTTAATCATCACGCAGAGAGTATTCAAAAATATTCAGTGATGTAACAGTGACA 

GTGTAGGCATAGAAGTAATCATTAGTAAATCTTAATTTGGGTTAAACTCATTCATAA 

CAGCTCCAGGTTGGGAGGGATCACTGAGCCTTCGCCACGTGCGGGTTAAAGATATTT 

TCTAACAAGAGAAGCAGAATTCTTCCTTGGCCATGCTCCCCATCACTGTGTCAGTAA 

GCAGAGGGGTGTTTCCAAGCAGAGAAAGAGCAGACAGTGTTATGCCTGCAAAGTCA 

GAGACTCAGCCCTCCCAGCTGGTCAGTTTACTGTCCTCCCGGTCATTAGTTGGCTCTG 

AAAAGGCCCATGTGTCCTTATTGGCAAGGACTTGCAGACATGCTAGAAAGAAATTT 

AATAAATAAAACACATATTAAATAATACAATGTTACAAAAATTGATCATATAAAGA 

AGTACATTCATAAATGCAATGTGAAAAATATATATAATTTTTATCTATTTACTGGTGC 

AAAGTTTTCTAAATTGCACATGTACTATTTTTATATTTATAAAAATATTT^ 

TATATAAAAGTGTAAAAGGCTCTTGGTCAAACAAGAGAGTTAAATTTACAAACTTTA 

ATTGTCCCGATAACATTATTATGATCTCTAATGACAGGGATCCTGCTTTTCATTGGGA 

AATGAGAAGCTATGAAGATATGTTTACAATAATAAGCCCATTTAGTGATAAAGTCCA 

ATGGGAAGCTAGCACACACTGGTTTATAAAGAGAACAGTTTCCTGAGTCTATGCAA 

GTTTACACTCTAGGGAATAAGAGTTCCTCTTTCTCCAGATTTCACTAGCATTTGTTGT 

CATCATTTATCTTCTTGATGATGAGCATTATAAGTGGAATAAGATAGGATCTCAAAG 

GAATGTCAATTTGGATGCCCTGAACAATCTTTCAGGTCTTTCTTTCAGTTCACTAGTC 

TATTCATTTATTGGATAATTGGGGGGATGGTGGTAATTTTTTTGCAGTTCT^ 

TTCCAAAAAACAAAAAACAAACCAACCAACCAAAAACCTCTGAAACTAGAACTACC 

AATCCATTACTGGGTATGTAACAAAGAGAAATCTGCACAGAATTTATTGCTACATTG 

TTCATTATTCACGACAGCCAAGAATGTGGAACCAACTTACGTAGCCGTCAAAATATG 

AACGGATAAAGAAAATGTGGAAATGTGTACAACAGAGTCCCATGTGGCCATAAAAG 

AGTGAAATCATGACATATGCAGGAAATGGATGCAACTGGAAATCAATTGGGCTAAT 

CAAAACAAGACAGACTCAAAAAGGAAACACCGTGTAGCTTCTCTGACAAACAGAAG 

CTAGATTTACACTTGTACGTGCGCATGTGTGTTTAGAATTTTATTTAGTTATACACTA 

TTCTAATCTGTGAGTGTGTATAAAGGCATGCATGTAAAGCAAAAACAAGCTAGCTG 

GGGTGGGTAGGAGAGAAAGCAATGAGAGGAGTTAATAAGAACGAAGCATAGTAAC 

ATAGGTGCCAGGATGAAATGCATTAATTTGTATGCTAACTAAACCACAGACAGGAG 

GCACACGTTCAAACCAGGGTGAAATCCCAGCACAGAGAAGGGGAAGTAGACACAA 

AGTTTCGCCACTAACCAAGAAGCCATTTGCAGTTGCTGCCTGCTGGGAAGGGGCGTT 
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CCAGTTTTCTCCAGTCTGACACTGTGTATAACAACCAGTTGACAATACAAAGTTGGC 
ATGATGGATGGTTTTTGTGCTATTTTTC^ 

GGTTGTTGTGGTGGTGGCTGTGGTTTTCATTTGTTTCTTTTGAGAGAGAGAAGGAAC 

ATGAAATTGGGTGGGTAGGAAGCTGGAAACGATCTGGAAGAAGTTGGGGAAAGAG 

AAAAATTGTATGGAGCATATTTAAACAAACAAACAAACAAACAAAAGGTTCATTTT 

GCCACAAAAAGGTGTGAATTAAATTAACCAGTTACGACTCTTAAAGAAAATATTCCC 

AATTATTCCCAGAGTTGCTATGTATGCTGTGCCTAGGACTTTGCTTGAACTGGCCCTA 

TAACTCTGGTGTGGTGTCTTTTCAGGATGCAGAAGAGAGGCAGGGAAGTCAGCTGCT 

TGCTGATCTCCCTCACTGCCATCTGCCTGGTGGTCACCCCTGGGAGCAGGGTCTGTC 

CTCGCCGATGTGCCTGCTATGTGCCCACAGAGGTGCACTGTACATTTCGGGACCTGA 

CCTCCATCCCAGACGGGCATCCCAGCCAATGTGGAACGAGTCAATTTAGGGTGTGTG 

GACCTTGCCTGATCTCCTTCTCAGAGAGGGACCACTGATTTTCCTGGTACTTTGCCCC 

CCAAACACCTGTGATTACTTTTAATAGTTTTCTTCTAAAATGGGTTCATACAAACCTT 

ATATTGTGGAGACAATGAACATTTTATCCCAATAGTCTTTTACTAGAACTTGAAGCC 

CCTCTTAGTTGTTTGGGAGCCTCATAATTATGGGGCAGCTTTATTCTGAATGAATTTT 

AAATGAAAAAGATACAGTTTCTGTTAACAATCATTATGATACCAAGGAAGAGGAAT 

TGTCATTGAATATTTTAAAAAAGCATTTCTTTTGCAATTTATAAATACCCATTACAAA 

ATGGCTTACTTAAAATACTTGCCTTACTAAATCTGACAAATTATGGTGATATTTTGAA 

GGTTTATGAAAATTTGTTTATGTGTATAAATGCACAAGAAATGGGATATGCCATCAC 

CTATGTGCCATTAGTGAGCATGTACAGTATGCCAAACACTATTGTTCACGTTTGGAG 

GAAGTAATGGGGGTGGGGGAGCAACAAGGGTTATAACCGTATACCCAGTGCCTTGG 

AAGCGATTGCAAACAGTAAAGACTGACATTGTGTTCTCCCTATGAGGGAGGGGCCTT 

GGGCTGAGCACTTTGCAATGAGCATTTGCTCATTGTGCTGGCAGGTTTTATGATAAC 

TTGACCCAAGCTAGAGTCACTGGAGAGGAAGGAACTTCAACTGAGAACATGCCTGA 

AGAAGATCAGATTATAGGCAGGCCTGTGGGGCATTTTCTTAATTAGTGATTCATGGG 

GCAGGGCCCAGTCCATTGTTCGTGGTACCATTTCTCAGGCACTATTAAAAAAAAAAA 

AACAGGCTGAGCAAGTGTCAAGGAGCAAGTCAGTGAGCAGCAGCCCTAATGATCTC 

TGCATCAGCTCCTGCCTCCAGGTTCCTACCCTATTTGAGTTCCTGTCCTAGCTCCCTA 

CAGTGATGAACAATGATGTGGAAGTATAAGCCAAATAAATCCTTTCTTCCCCAACTT 

GCTGTTGGTCATGATGTTTCATCACAGTGATAATAGTCCTCATGAAGATGCTGGTGT 

TTATAACACCTTTGGACTAAATTCTGTTATCTATAGCTGAGGAAAATGGAGCATAGA 

AAGTCTCCAGACTACACCAGAGTGTAATCTGGGCCTGAGCTTAGAATCACACCCAC 

GTGCACTCCACTGCCGGGGCTTCTTAACCGGAACACAGTTGTAAAAGGGAATTTTCT 

GTTTGTTTCCATTTTGACATGTGGACTTTAATTGACGATTCATCTGAAGCTGAAAATG 

ATTTTTTTTCCAGGTATAACAGCCTCACTAGATTGACAGAAAATGACTTTTCTGGCCT 

GAGCAGACTGGAGTTACTCATGCTGCACAGCAATGGCATTCACAGAGTCAGTGACA 

AGACCTTCTCGGGCTTGCAGTCCTTGCAGGTGAGATAGGTAGAGGGTGATGGAGGC 

TGAGAAGAGAGGTGCAACTGTGGGTTATACCCAAAAGCTGCTGATTCCCGTGGGAG 

ACATTCTATAAGCATTCTATAAACTAGAGGCAGATATCAAGGAAGGATTTCAATTGT 

AATGCAATTTTATGAGAAAATTTGAATATTAAGAAAATGCTGGGGAAAATGCTTAC 
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ACAATTGCGAGGACCTAATTTAGGATCTCCAATAGCCACATAAAAAGCACAGCATG 

GCGGCAGACACCTGCAATTCCTGTCCCTGGAAGCACCTGTTCAGAATCCCAGAGACT 

CATTGGCCAAACACTCTATTCAATCAATGAAGTCCATATTCAGTGACAAAACTTGAC 

TCAGAAACTAATGTGGAAAGCATCAGGAAGACAGCCAACATCTGGTCTCTACTCAT 

GCATGAATAAGGGATCCCAGAGAGAAGGGAAGAAAAAGGAAGGAAGGAAGGAAG 

GAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGAGAGGGAGGAAAGGAGGG 

AGGGAAGGAAGGAAAGGGAAAGGAAAAAAGAGATGGGGAGGGAGGGAAGGAAAG 

GAAAGGGGGAGAAAGAAGAGAAGAAAGGAAAATAAATAAATTTTCAGGGATTATT 

ACACCTTTAAATTTTATCCATAAAAGGTCATTTCCACCTGTTTGTCTGGAAGTAGAGT 

GGGATCCCTTATATAAGGGCAGTCTTTAACATAGTAGCATTTTATAAACCATTACAA 

ATTTTGAGTTTTCTCTACTTTTTATCCTCTACCATCTTCAAACTGAAACTACAAT^ 

CCCACAAATGAAGAAAATGCTGTAAGAGTTTTCACACACCGAAGTGGGAAACTTAA 

GGATTAGACAAGTCTAACAATGAGAATGGGGAGAACAAAAAGAGACTGCACAGGG 

AGCCCTTTCTCTGCTTATAATCTTGACACTTGAGAAGCTAATTGACGCTGCATGACTA 

CTCAACTCTTTAAGCAAACAATGCTGTTGTTCATGAAAAGCACAATAAAGTACATAT 

GTCCCATAATATTCATCAAAATTTGCATGCAGCACATAATAGCAATCAAAGCAATAA 

CACCCACTGTTCACAGAGACTTTAAACATGAAACTGGAACTATGTCTAGTGTTTTGA 

CTTAGGGTACATAGTATGCTGTGTCTGTATGTACCAATGTTGATTTAGGTCATCAGA 

CAGCATTTGGAACATGTATCTTCAGGAGGAATCATTCATGTATCCTGCATGAAATTC 

TCCACCTATGTTTATTCTCTTAGCCAGGTITITCTCTGATGGAGAAACATTGGGTTTG 

AGGTTTTACTCCCAGGTAACATTTAGGGAAAAGCTGTCTATGTTCTCAGTTTGGCTTT 

TATTTATGAGGGATGTTGGTATTCCAGAAAATTCTCTTTTGAAGAGATTACAATTTA 

GGTCAAAACAGAAAAATATGTAAAAAGTTATTGTTTTTATTAGTATTTCATGTTCTTT 

TCTTTTTTAAAAATGGTATGCTTAGAACTAATTAAGATTAGATTAGATTAGATTAGA 

AAATAATCAGAGAGGGATTTGATGAATGCTAAAGCATCATGAAAAATTCAAAATTT 

TTTGCTTCTAATTCAGAATCAATTAAATTCATATTACTATAAAAGACAGCACGCCAG 

ATGTGTGCCAGCTGAGGAGTGGATAAACTGTGTAACGTGAGTGCTATGTAGAAACA 

GAAAGGAGTGAAGGGTTGATGTGCGCTGCAACATCTTGAAAACATTCGGCTACATG 

ATGGAAGCCAGGCACAAAAAGCCACATATTGCATGGTTATGTTTATATGAAATGTTT 

AAAATACATGGATTCTTAGCAAACAGAGTAAGATGTTACTTAGGGTCAGGAAAAGA 

TTAAAAAAAAAAAAACTATTGATGTGGAATGATCTTAATTTGGGGAAAAGACAATT 

TCCTAAGACGAAATAGTTGAGGTAGATATAGTTATATCCCTGTGGATATTGTAATAA 

ACCAGCATGCTGTGCTCTGAGAAGGGCCTAATGAAGGGGCAGGAGGAAGTGAAATG 

AGATGGTAGAAAGGAAAGTCATATACCATGGCTTCTCTCGTGGGTGGAATCTAGAT 

ATGTTAATATATTGACATAAAGGAAGGAATTGTTTAGGGAAGGATCAAAACCAACA 

GGAGTGAGGGAGACAATAGGAACCAATGAGAGGCAAAGTTCATGGTCAATGTGTGT 

GGAGACACCATAATAAAACTCCTTTTTTGTTTGCTAACTAAAACCACTAAAATCTAA 

AAACAAAACATTTTTGCACAAGAATTATTTATTATTCAATAAAGATGTTTAAATGGG 

GGAAGTTGAAGTTCATTGATAGTCTCATAAATCTTAAATGTATTTAAACTGCTTTTTA 

CGTTTTTTATTATTAATTACTCTTGCTGTCATTATTATCATCATCATTATCGTCATCAT 
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CATCACTAATGCTTTTCACCATACACAAATGTAGGCAGAAGAGTGTAATCCACTTAG 

TGAGGCAATCTTGGAGAGGGAAAGGAAGCGGATGCGGGGCAGAGGCACACAGGAG 

GACAGTGAGAGGGAAATGAACAAGAAAAAATGTGGACACATGCACAAAAATTCCA 

TAGTCCACTACATTACTTTGTATTCTAATATTAAGAAAATAATAAACCCATTTCTGTG 

CACTTATCACCCAGGCTCAACAGTTATCTTGGCCACAGATCCTGTCTCACTGCATCCT 

GTCCACCTGAGTCCACTTAGCGTTCTGAATCCAATCCAGGGCATGATGCTTACTCCT 

ACACAGAACTAAAGATTAAAGAGAGTTTAAAAGTAACCATGACATCTCTCTGTTCCT 

TTAGCGATAAGTTCTTAATATTTATGGCTGCTTGTGTATGTTCTAATTTCTCTAATATT 

GTCACATTTAGTTGGCAACTACTTTGTTTGAATTGAGTTGGAGTTAAGGTCCCATAG 

GATTAATCTCAACATATTTCTATATTTA 

GAGAAAACAAATATGCCCATATCTTTCTTTACAGGTCTTAAAAATGAGCTATAACAA 

AGTCCAAATAATTGAGAAGGATACTTTGTATGGACTCAGGAGCTTGACCCGGTTGCA 

CCTGGATCACAACAACATTGAGTTTATCAACCCCGAGGCGTTTTACGGACTCACCTT 

GCTCCGCTTGGTACATCTAGAAGGAAACCGGCTGACAAAGCTCCATCCAGACACAT 

TTGTCTCTTTGAGCTATCTCCAGATATTTAAAACCTCCTTCATTAAGNACCTGTACTT 

GTATGATAACTTCATTGACCTCCCTCCCAAAAGAAATGGTCTCCTCTATGCCAAACC 

TAGAAAGCCTTTACTTGCATGGAAACCCATGGACCTGTGACTGCCATTTAAAGTGGT 



ATARKACGTATTTTCCTCAATTTCATTTAGAATGATATCCCAAAAGTCCCCCATAACC 



GGGGCATATAAAGTTTGCGTGTCCAATGGACCTCTCTTTCCAGTGATGGCCAACTAG 
GCCATCTTTTGATACATATGCAGCTAGAGTCAAGAGCTCTGGGGTACTGGTTAGTTC 
ATAATGTTGTTGCACCTACAGGGTTGAA (SEQ ID NO:20) 
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Figure 51 

MPKRAHWGALSVVLILLWGHPRVALACPHPCACYVPSEVHCTFRSLASVPAGIARHVE 

RINLGFNSIQAI^ETSFAGLTKI^LLMfflGNEIPSIPDGALRDLSSLQVFKFSYNKLRVITG 

QTLQGLSNLMRLHTOHNKIEFIHPQAFNGLTSLRLLHLEGNLLHQLHPSTFSTFTFLDYFR 

LSTIRHLYLAENMVRTLPASMLRNMPIXENLYLQGNPWTCDCEMRWFLEWDAKSRGIL 

KCKKDKA YEGGQLC AMCFS PKKL YKHEIHKLKDMTCLKPS IESPLRQNRS RS IEEEQEQ 

EEDGGSQLILEIGFQLPQWSISLNMTDEHG 

ATVALDFECPMTRENYEKLWKLIAYYSEVPVKLHRELMLSKDPRVSYQYRQDADEEAL 

YYTGVRAQILAEPEWVMQPSIDIQLNRRQSTAKKVLLSYYTQYSQTISTKDTRQARGRS 

W VMIEPS GAVQRDQTVLEGGPCQLS CN VKASESPSIFWVLPDGSILKAPMDDPDSKFSIL 

SSGWLRIKSMEPSDSGLYQCIAQVRDEMDRMVYRVLVQSPSTQPAEKDTVTIGKNPGES 

VTLPCNALAIPEAHLSWILPNRRIINDLANTSHVYMLPNGTLSIPKVQVSDSGYYRCVAV 

NQQGADHFTVGITVTKKGSGLPSKRGRRPGAKAI^RVREDIVEDEGGSGMGDEENTSR 

RLLHPKDQEVFLKTKDDAINGDKXAKKGRRKLKLWKHSEKEPETNVAEGRRVFESRRR 

INMANKQINPERWADILAKVRGKNLPKGTEVPPLIKTTSPPSLSLEVTPPFPAVSPPSASP 

VQTVTSAEESSADWLLGEEEHVLGTISSASMGLEHNHNGVILVEPEVTSTPLEEVVDDL 

SEKTEEITSTEGDLKGTAAPTLISEPYEPSPTLHTLDTVYEKPTHEETATEGWSAADVGSS 

PEPTSSEYEPPLDAVSLAESEPMQYFDPDI^TKSQPDEDKMKEDTFAHLTPTPTIWVNDS 

STSQLFEDSTIGEPGWGQSHLQGLTDNIHLVKSSLSTQDTLLIKKGMKEMSQTLQGGNM 

LEGDPTHSRSSESEGQESKSm.PDSTLGIMSSMSPVKKPAETTVGTIXDKDTTTVTTrPR 

QKVAPSSTMSTHPSRPvPJ'NGRRRLRPNKFRHRHKQTPPTTFAPSETFSTQPTQAPDIKISS 

QVESSLVPTAWVDNTVNTPKQLEMEKNAEPTSKGTPRRKHGKRPNKHRYTPSTVSSRA 

SGSKPSPSPENKHRNWTPSSETIIXPRTVSLKTEGPYDSIJDYMTTTRKIYSSYPKVQETLP 

VTYKPTSDGKEIKDDVATNVDKHKSDILVTGESITNAIPTSRSLVSTMGEFKEESSPVGFP 

GTPTWNPSRTAQPGRLQTDPVTTSGENLTDPPLLKELEDVDFTSEFLSSLTVSTPFHQEE 

AGSSTTI^SIKVEVASSQAETTTIJDQDHI^TTVAILI^ETRPQNHTPTAARMKEPASSSPS 

TILMSLGQTTTTKPAIJSPRISQASRDSKENVFLNYVGNPETEATPVNNEGTQHMSGPNE 

LSTPSSDRDAFNLSTKLELEKQVFGSRSLPRGPDSQRQDGRVHASHQLTRVPAKPILPTA 

TVRLPEMSTQSASRYWTSQSPRHWTNKPEITTYPSGALPENKQFTTPRLSSTTIPLPLHM 

SKPSIPSKFTDRRTDQFNGYSKVFGNNNIPEARNPVGKPPSPRIPHYSNGRLPFFTNKTLSF 

PQLGVTRRPQIPTSPAPVMRERKVIPGSYNRfflSHSTFHIJ)FGPPAPPII.HTPQTTGSPSTN 

LQNIPMVSSTQSSISFITSSVQSSGSFHQSSSKFFAGGPPASKFWSLGEKPQILTKSPQTVSV 

TAETDTVFPCEATGKPKPFVTWTKVSTGALMTPNTRIQRFEVLKNGTLVIRKVQVQDRG 

QYMCTASNLHGLDRMVVLLSVTVQQPQILASHYQDVTVYLGDTIAMECLAKGTPAPQI 

SWIFPDRRVWQTVSPVESRITLHENRTLSIBGEASFSDRGVYKCVASNAAGADSLAIRLHV 

AALPPVfflQEKLEMSLPPGI^IfflHCTAKAAPLPSVRWVLGDGTQIRPSQFLHGNLFVFP 

NGTLYIRNLAPKDSGRYECVAANLVGSARRTVQLNVQRAAANARITGTSPRRTDVRYG 

GTLKLDCSASGDPWPR1LWRLPSKRMIDALFSFDSRIKVFANGTLVVKSVTDK 
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DAGDYLCVARNKVGDDYWLKVDVVMKPAKIEHKEENDHKVFYGGDLKVDCVATGL 

PNPEISWSLPDGSLVNSFMQSDDSGGRTKRYVVFNNGTLYFNEVGMREEGDYTCFAEN 

QVGKDEMRVRVKVWAPATIRNKTYLAVQVPYGDVVTVACEAKGEPMPKVTWLSPTN 

KVIPTSSEKYQIYQDGTLLIQKAQRSDSGNYTCLVRNSAGEDRKTVWfflVNVQPPKING 

NPNPITTVPvEIAAGGSRKLroCKAEGIPTPRVLWAFPEGVVLPAPYYGNRITVHGNGSLDI 

RSLRKSDSVQLVCMARNEGGEAPXIVQLTVLEPMEKPIFHDPISEmAMAGHTISLNCS 

AAGTPTPSLVWVLPNGTDLQSGQQLQRFYHKADGMLfflSGLSSVDAGAYRCVARNAA 

GHTERLVSLKVGLJCPEANKQYHNLVSIE^GETLKIJPCTPPGAGQGPvFSWTlJPNGMHLEG 

PQTLGRVSLIJDNGTLTVREASVFDRGTYVCRMETEYGPSVTSIPVmAYPPRrrSEPTPVI 

YTRPGNTVKLNCMAMG1PKADITWELPDKSHLKAGVQARLYGNRFLHPQGSLTIQHAT 

QRDAGFYKCMAKNILGSDSKTTYIHVF (SEQ ID NO: 21) 
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Figure 52 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT 
TTGGGGCCAT CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT 
ACGTCCCCAG CGAGGTCCAC TGCACGTTCC GATCCCTGGC TTCCGTGCCC 
GCTGGCATTG CTAGACACGT GGAAAGAATC AATTTGGGGT TTAATAGCAT 
ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG TTGGAGCTAC 
TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 
GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT 
GATCACAGGA CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA 
TTGACCACAA CAAGATCGAG TTTATCCACC CTCAAGCTTT CAACGGCTTA 
ACGTCTCTGA GGCTACTCCA TTTGGAAGGA AATCTCCTCC ACCAGCTGCA 
CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC AGACTCTCCA 
CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 
AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA 
TCCGTGGACC TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA 
AATCCAGAGG AATTCTGAAG TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT 
CAGTTGTGTG CAATGTGCTT CAGTCCAAAG AAGTTGTACA AACATGAGAT 
ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA GAGTCCCCTC 
TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 
GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG 
CATCTCTTTG AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT 
GTGACATCAA GAAACCAATG GATGTGTACA AGATTCACTT GAACCAAACG 
GATCCTCCAG ATATTGACAT AAATGCAACA GTTGCCTTGG ACTTTGAGTG 
TCCA ATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG ATAGCATACT 
ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 
CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA 
CTACACAGGT GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC 
AGCCATCCAT AGATATCCAG CTGAACCGAC GTCAGAGTAC GGCCAAGAAG 
GTGCTACTTT CCTACTACAC CCAGTATTCT CAAACAATAT CCACCAAAGA 
TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG CCTAGTGGAG 
CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 
TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA 
TGGCTCCATC CTGAAAGCGC CCATGGATGA CCCAGACAGC A AGTTCTCC A . 
TTCTCAGCAG TGGCTGGCTG AGGATCAAGT CCATGGAGCC ATCTGACTCA 
GGCTTGTACC AGTGCATTGC TCAAGTGAGG GATGAAATGG ACCGCATGGT 
ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC GAGAAAGACA 
CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 
GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG 
GATAATTAAT GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG 
GAACTCTTTC CATCCCAA AG GTCCAAGTCA GTGATAGTGG 
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TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC CATTTTACGG 
TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 
AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA 
GGATGAAGGG GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC 
TTCTGCATCC AAAGGACCAA GAGGTGTTCC TCAAAACAAA GGATGATGCC 
ATCAATGGAG ACAAGAAAGC CAAGAAAGGG AGAAGAAAGC TGAAACTCTG 
GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA GGTCGCAGAG 
TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 
GAGCGCTGGG CTGATATTTT AGCC AAAGTC CGTGGGAA AA ATCTCCCTAA 
GGGCACAGAA GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA 
GCCTAGAAGT CACACCACCT TTTCCTGCTG TTTCTCCCCC CTCAGCATCT 
CCTGTGCAGA CAGTAACCAG TGCTGAAGAA TCCTCAGCAG ATGTACCTCT 
ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA GCCAGCATGG 
GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 
AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA 
GATAACTTCC ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA 
TATCTGAGCC TTATGAACCA TCTCCTACTC TGCACACATT AGACACAGTC 
TATGAAAAGC CCACCCATGA AGAGACGGCA ACAGAGGGTT GGTCTGCAGC 
AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT GAGCCTCCAT 
TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 
GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC 
CTTTGCACAC CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA 
CATCACAGTT ATTTGAGGAT TCTACTATAG GGGAACCAGG TGTCCCAGGC 
CAATCACATC TACAAGGACT GACAGACAAC ATCCACCTTG TGAAAAGTAG 
TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG AAAGAGATGT 
CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 
AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA 
CTCCACACTG GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG 
AAACCACAGT TGGTACCCTC CTAGACAAAG ACACCACAAC AGTAACAACA 
ACACCAAGGC AAAAAGTTGC TCCGTCATCC ACCATGAGCA CTCACCCTTC 
TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC AAATTCCGCC 
ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 
TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA 
GAGTTCTCTG GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA . 
AACAGTTGGA AATGGAGAAG AATGCAGAAC CCACATCCAA GGGAACACCA 
CGGAGAAAAC ACGGGAAGAG GCCAAACAAA CATCGATATA CCCCTTCTAC 
AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT CCAGAAAATA 
AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 
ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC 
AACCACCAGA AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC 
CAGTCACATA 
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TAAACCCACA TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG 
TTGACAAACA TAAAAGTGAC ATTTTAGTCA CTGGTGAATC AATTACTAAT 
GCCATACCAA CTTCTCGCTC CTTGGTCTCC ACTATGGGAG AATTTAAGGA 
AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG AATCCCTCAA 
GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 
GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA 
TTTCACTTCC GAGTnTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC 
AGGAAGAAGC TGGTTCTTCC ACAACTCTCT CAAGCATAAA AGTGGAGGTG 
GCTTCAAGTC AGGCAGAAAC CACCACCCTT GATCAAGATC ATCTTGAAAC 
CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT CACACCCCTA 
CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 
ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG 
AATATCTCAA GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG 
TGGGGAATCC AGAAACAGAA GCAACCCCAG TCAACAATGA AGGAACACAG 
CATATGTCAG GGCCAAATGA ATTATCAACA CCCTCTTCCG ACCGGGATGC 
ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA TTTGGTAGTA 
GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 
GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC 
AACAGTGAGG CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG 
TAACTTCCCA GTCACCTCGT CACTGGACCA ACAAACCGGA AATAACTACA 
TATCCTTCTG GGGCTTTGCC AGAGAACAAA CAGTTTACAA CTCCAAGATT 
ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA CCCAGCATTC 
CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 
GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC 
TCCCAGTCCA AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA 
CCAACAAGAC TCTTTCTTTT CCACAGTTGG GAGTCACCCG GAGACCCCAG 
ATACCCACTT CTCCTGCCCC AGTAATGAGA GAGAGAAAAG TTATTCC AGG 
TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG GACTTTGGCC 
CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 
ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC 
CTTTATAACA TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT 
CAAAGTTCTT TGCAGGAGGA CCTCCTGCAT CCAAATTCTG GTCTCTTGGG 
GAAAAGCCCC AAATCCTCAC CAAGTCCCCA CAGACTGTGT CCGTCACCGC 
TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA CCAAAGCCTT 
TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 
AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA 
GGTTCAAGTA CAAGATCGAG GCCAGTATAT GTGCACCGCC AGC AACCTGC 
ACGGCCTGGA CAGGATGGTG GTCTTGCTTT CGGTCACCGT GCAGCAACCT 
CAAATCCTAG CCTCCCACTA CCAGGACGTC ACTGTCTACC TGGGAGACAC 
CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC CAAATTTCCT 
GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC 
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CGTGGAGAGC CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG 
AGGCGTCCTT CTCAGACAGA GGCGTCTATA AGTGCGTGGC CAGCAATGCA 
GCCGGGGCGG ACAGCCTGGC CATCCGCCTG CACGTGGCGG CACTGCCCCC 
CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC CCGGGGCTCA 
GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 
TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG 
GAACTTGTTT GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC 
CCAAGGACAG CGGGCGCTAT GAGTGCGTGG CCGCCAACCT GGTAGGCTCC 
GCGCGCAGGA CGGTGCAGCT GAACGTGCAG CGTGCAGCAG CCAACGCGCG 
CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC GGAGGAACCC 
TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 
AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG 
AATCAAGGTG TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA 
AAGATGCCGG AGATTACCTG TGCGTAGCTC GAAATAAGGT TGGTGATGAC 
TACGTGGTGC TCAAAGTGGA TGTGGTGATG AAACCGGCCA AGATTGAACA 
CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC CTGAAAGTGG 
ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 
GACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG 
CACCAAGCGC TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG 
TGGGGATGAG GGAGGAAGGA GACTACACCT GCTTTGCTGA AAATCAGGTC 
GGGAAGGACG AGATGAGAGT C AGAGTCAAG GTGGTGACAG CGCCCGCCAC 
CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT GGAGACGTGG 
TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 
TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT 
ATACCAAGAT GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG 
GCAACTACAC CTGCCTGGTC AGGAACAGCG CGGGAGAGGA TAGGAAGACG 
GTGTGGATTC ACGTCAACGT CCAGCCACCC AAGATCAACG GTAACCCCAA 
CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT CGGAAACTGA 
TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 
CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT 
CCATGGCAAC GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG 
TCCAGCTGGT ATGCATGGCA CGCAACGAGG GAGGGGAGGC GAGGTTGATC 
GTGCAGCTCA CTGTCCTGGA GCCCATGGAG AAACCCATCT TCCACGACCC 
GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC AGCCTCAACT 
GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 
GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC 
TGACGGCATG CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT 
ACCGCTGCGT GGCCCGCAAT GCCGCTGGCC ACACGGAGAG GCTGGTCTCC 
CTGAAGGTGG GACTGAAGCC AGAAGCAAAC AAGCAGTATC ATAACCTGGT 
CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC CCTCCCGGGG 
CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 
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GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC 
GGTTCGTGAG GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG 
AGACGGAGTA CGGCCCTTCG GTCACCAGCA TCCCCGTGAT TGTGATCGCC 
TATCCTCCCC GGATCACCAG CGAGCCCACC CCGGTCATCT ACACCCGGCC 
CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT CCCAAAGCTG 
ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 
GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT 
CCAGCATGCC ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAA AA 
ACATTCTCGG CAGTGACTCC AAAACAACTT ACATCCACGT CTTCTGAAAT 
GTGGATTCCA GAATGATTGC TTAGGAACTG ACAACAAAGC GGGGTTTGTA 
AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT GTCACAGTGC 
ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 
GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT 
GAGACACTTT CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG 
TCTGTGCTCT GACTGCAATT TTTCTTCTTT TGCAAATGCC ACTCGACTGC 
CTTCATAAGC GTCCATAGGA TATCTGAGGA ACATTCATCA AAAATAAGCC 
ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA TCACCTAGTT 
AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAGTCA 
TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG 
GATTTAGAAC CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT 
TACATACATA CAGCTACCAT TTTATATGAA AAAAGAAAAA CATTTCTTCC 
TGGAACTCAC TTTTTATATA ATGTTTTATA TATATATTTT TTCCTTTCAA 
ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT ATTAAAATTA 
ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 
ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC 
TGTTATATTA CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT 
AGATTTCCTT GTATGCAAAG TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT 
GAGAGGAGAG GAAGGAGAAA ACTGCATCAT AACTTTACAG AATTGAATCT 
AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT CTGGCTTGTC 
CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 
CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT 
TAAGGTCAAT ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 
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Figure 5z 
Human OCP: nucleotide sequence of QRF 



ATGAAGGTAAAAGGCAGAGGAATCACCTGCTTGCTGGTCTCCTTTGCTGTGATCTGC 

CTGGTCGCCACCCCTGGGGGCAAGGCCTGTCCTCGCCGCTGTGCCTGTTATATGCCT 

ACGGAGGTACACTGCACATTTCGGTACCTGACTTCCATCCCAGACAGCATCCCGCCC 

AATGTGGAACGCATCAATTTAGGATACAACAGCTTGGTTAGATTGATGGAAACAGAT 

TTTTCTGGCCTGACCAAACTGGAGTTACTCATGCTTCACAGCAATGGCATTCACACA 

ATCCCTGACAAGACCTTCTCAGATTTGCAGGCCTTGCAGGTCTTAAAAATGAGCTAT 

AATAAAGTCCGAAAACTTCAGAAAGATACTTTTTATGGCCTCAGGAGCTTGACACGA 

TT GC AC AT GGACCACAAC AAT AT TGAGT T TAT AAACCCAGAGGTT T TTTATGGGCTC 

AACTTTCTCCGCCTGGTGCACTTGGAAGGAAATCAGCTCACTAAGCTCCACCCAGAT 

ACATTTGTCTCTTTGAGCTACCTCCAGATATTTAAAATCTCTTTCATTAAGTTCCTA 

TACTTGTCTGATAACTTCCTGACCTCCCTCCCTCAAGAGATGGTCTCCTATATGCCT 

GACCTAGACAGCCTTTACCTGCATGGAAACCCATGGACCTGTGATTGCCATTTAAAG 

TGGTTGTCTGACTGGATACAGGAGAAGCCAGATGTAATAAAATGCAAAAAAGATAGA 

AGTCCCTCTAGTGCTCAGCAGTGTCCACTTTGCATGAACCCTAGGACTTCTAAAGGC 

AAGCCGTTAGCTATGGTCTCAGCTGCAGCTTTCCAGTGTGCCAAGCCAACCATTGAC 

TCATCCCTGAAATCAAAGAGCCTGACTATTCTGGAAGACAGTAGTTCTGCTTTCATC 

TCTCCCCAAGGTTTCATGGCACCCTTTGGCTCCCTCACTTTGAATATGACAGATCAG 

TCTGGAAATGAAGCTAACATGGTCTGCAGTATTCAAAAGCCCTCAAGGACATCACCC 

ATTGCATTCACTGAAGAAAATGACTACATCGTGCTAAATACTTCATTTTCAACATTT 

TTGGTGTGCAACATAGATTACGGTCACATTCAGCCAGTGTGGCAAATTTTGGCTTTG 

TACAGTGATTCTCCTCTGATACTAGAAAGGAGCCACTTGCTTAGTGAAACACCGCAG 

CTCTATTACAAATATAAACAGGTGGCTCCTAAGCCTGAAGACATTTTTACCAACATA 

GAGGCAGATCTCAGAGCAGATCCCTCTTGGTTAATGCAAGACCAAATTTCCTTGCAG 

CTGAACAGAACTGCCACCACATTCAGTACATTACAGATCCAGTACTCCAGTGATGCT 

C AAAT C AC T T TACC AAGAGC AGAGAT GAGGCC AGT GAAACACAAAT GGACT AT GATT 

TCAAGGGATAACAATACTAAGCTGGAACATACTGTCTTGGTAGGTGGAACCGTTGGC 

CTGAACTGCCCAGGCCAAGGAGACCCCACCCCACACGTGGATTGGCTTCTAGCTGAT 

GGAAGTAAAGTGAGAGCCCCTTATGTCAGTGAGGATGGACGGATCCTAATAGACAAA 

AGTGGAAAATTGGAACTCCAGATGGCTGATAGTTTTGACACAGGCGTATATCACTGT 

ATAAGCAGCAATTATGATGATGCAGATATTCTCACCTATAGGATAACTGTGGTAGAA 

CCTTTGGTCGAAGCCTATCAGGAAAATGGGATTCATCACACAGTTTTCATTGGTGAA 

ACACTTGATCTTCCATGCCATTCTACTGGTATCCCAGATGCCTCTATTAGCTGGGTT 

ATTCCAGGAAACAATGTGCTCTATCAGTCATCAAGAGACAAGAAAGTTCTAAACAAT 

GGCACATTAAGAATATTACAGGTCACCCCGAAAGACCAAGGTTATTATCGCTGTGTG 

GCAGCCAACCCATCAGGGGTTGATTTTTTGATTTTCCAAGTTTCAGTCAAGATGAAA 

GGACAAAGGCCCTTGGAGCATGATGGAGAAACAGAGGGATCTGGACTTGATGAGTCC 

AATCCTATTGCTCATCTTAAGGAGCCACCAGGTGCACAACTCCGTACATCTGCTCTG 

ATGGAGGCTGAGGTTGGAAAACACACCTCAAGCACAAGTAAGAGGCACAACTATCGG 

GAATTAACACTCCAGCGACGTGGAGATTCAACACATCGACGTTTTAGGGAGAATAGG 

AGGCATTTCCCTCCCTCTGCTAGGAGAATTGACCCACAACATTGGGCGGCACTGTTG 

GAGAAAGCTAAAAAGAATGCTATGCCAGACAAGCGAGAAAATACCACAGTGAGCCCA 

CCCCCAGTGGTCACCCAACTCCCAAACATACCTGGTGAAGAAGACGATTCCTCAGGC 

ATGCTCGCTCTACATGAGGAATTTATGGTCCCGGCCACTAAAGCTTTGAACCTTCCA 

GCAAGGACAGTGACTGCTGACTCCAGAACAATATCTGATAGTCCTATGACAAACATA 

AATTATGGCACAGAATTCTCTCCTGTTGTGAATTCACAAATACTACCACCTGAAGAA 

CCCACAGATTTCAAACTGTCTACTGCTATTAAAACTACAGCCATGTCAAAGAATATA 



AACCCAACCATGTCAAGCCAAATACAAGGCACAACCAATCAACATTCATCCACTGTC 

TTTCCACTGCTACTTGGAGCAACTGAATTTCAGGACTCTGACCAGATGGGAAGAGGA 

AGAGAGCATTTCCAAAGTAGACCCCCAATAACAGTAAGGACTATGATCAAAGATGTC 

AATGTCAAAATGCTTAGTAGCACCACCAACAAACTATTATTAGAGTCAGTAAATACC 

ACAAATAGTCATCAGACATCTGTAAGAGAAGTGAGTGAACCCAGGCACAATCACTTC 

TATTCTCACACTACTCAAATACTTAGCACCTCCACGTTCCCTTCAGATCCACACACA 

GCTGCTCATTCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAACATCCCGCTGTTC 

AGACGCTTTGGGAGGCAGAGGAAAATTGGCGGAAGGGGGCGGATTATCAGCCCATAT 

AGAACTCCAGTTCTGCGACGGCATAGATACAGCATTTTCAGGTCAACAACCAGAGGT 

TCTTCTGAAAAAAGCACTACTGCATTCTCAGCCACAGTGCTCAATGTGACATGTCTG 

TCCTGTCTTCCCAGGGAGAGGCTCACCACTGCCACAGCAGCATTGTCTTTTCCAAGT 

GCTGCTCCCATCACCTTCCCCAAAGCTGACATTGCTAGAGTCCCATCAGAAGAGTCT 

ACAACTCTAGTCCAGAATCCACTATTACTACTTGAGAACAAACCCAGTGTAGAGAAA 

ACAACACCCACAATAAAATATTTCAGGACTGAAATTTCCCAAGTGACTCCAACTGGT 

GCAGTCATGACATATGCTCCAACATCCATACCCATGGAAAAAACTCACAAAGTAAAC 

GCCAGTTACCCACGTGTGTCTAGCACCAATGAAGCTAAAAGAGATTCAGTGATTACA 

TCGTCACTTTCAGGTGCTATCACCAAGCCACCAATGACTATTATAGCCATTACAAGG 

TTTTCAAGAAGGAAAATTCCCTGGCAACAGAACTTTGTAAATAACCATAACCCAAAA 

GGCAGAT T AAGGAAT CAACAT AAAGT T AGT T T AC AAAAAAGC AC AGCT GT GAT GCTT 

CCTAAAACATCTCCTGCTTTACCACAGAGACAAAGTTCCCCTTTCCATTTCACCACA 

CTTTCAACAAGTGTGATGCAAATTCCATCTAATACCTTGACTACCGCTCACCACACT 

ACGACCAAAACACACAATCCTGGAAGTCTTCCAACAAAGAAGGAGCTTCCCTTCCCA 

CCCCTTAACCCTATGCTTCCTAGTATTATAAGCAAAGACTCAAGTACAAAAAGCATC 

ATATCAACGCAAACAGCAATACCAGCAACAACTCCTACCTTCCCTGCATCTGTCATC 

ACTTATGAAACCCAAACAGAGAGATCTAGAGCACAAACAATACAAAGAGAACAGGAG 

CCTCAAAAGAAGAACAGGACTGACCCAAACATCTCTCCAGACCAGAGTTCTGGCTTC 

ACTACACCCACTGCTATGACACCTCCTGCTCTGGCATTCACTCATTCCCCACCAGAA 

AACACAACTGGGATTTCAAGCACAATCAGTTTTCATTCAAGAACTCTTAATCTGACA 

GATGTGATTGAAGAACTAGCCCAAGCAAGTACTCAGACTTTGAAGAGCACAATTGCT 

TCTGAAACAACTTTGTCCAGCAAATCACACCAGAGTACCACAACTAGGAAAGCATCA 

TTAGACACTCCCATACCACCATTCTTGAGCAGCAGTGCTACTCTAATGCCAGTTCCC 

ATCTCCCCTCCCTTTACTCAGAGAGCAGTTACTGACACACGTGGCGACTCCCATTTC 

CGGCTTATGACAAATACAGTGGTCAAGCTGCACGAATCCTCAAGGCACAATCTCCAA 

ATGCCAAGTTCACAATTGGAACCACTCACTTCATCTACCTCTAATCTGTTACATTCT 

ACTCCCATGCCAGCACTAACAACAGTTAAATCACAGAATTCCAAATTAACTCCATCT 

CCCTGGGCAGAATACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGC 

AAAAAGCCAGAAGTAAGCATGTTGGCTACTACAGGCCTGTCCGAGGCCACCACTCTT 

GTTTCAGATTGGGATGGACAGAAGAACACAAAGAAGAGTGACTTTGATAAGAAACCA 

GTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCTTTGTCTAGGTATATA 

TTTGAAAAGCCCAGGATAGTTGGAGGAAAAGCTGCAAGTTTTACTATTCCAGCTAAC 

TCAGATGCCTTTCTTCCCTGTGAAGCTGTTGGAAATCCCCTGCCCACCATTCATTGG 

ACCAGAGTTTCAGGACTTGATTTATCTAGAGGAAACCAGAATAGCAGGGTCCAGGTT 

CTCCCCAATGGTACCCTGTCCATCCAGAGGGTGGAAATTCAGGACCGCGGACAGTAC 

TTGTGTTCCGCATCCAATCTGTTTGGCACAGACCACCTTCATGTCACCTTGTCTGTG 

GTTTCCTATCCTCCCAGGATCCTGGAGAGACGTACCAAAGAGATCACAGTTCATTCC 

GGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCCCTACAGTTACC 

TGGATTCTTGCAAACCAAACAGTTGTCTCAGAATCATCCCAGGGAAGTAGGCAGGCT 

GTGGTGACGGTTGACGGAACATTGGTCCTCCACAATCTCAGTATTTATGACCGTGGC 

TTTTACAAATGTGTGGCCAGCAACCCAGGTGGCCAGGATTCACTGCTGGTTAAAATA 

CAAGTCATTGCAGCACCACCTGTTATTCTAGAGCAAAGGAGGCAAGTCATTGTAGGC 

ACTTGGGGTGAAAGTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCTCAGCCCAGC 



GTTTACTGGGTCCTCTCTGATGGCACTGAAGTGAAACCATTACAGTTTACCAATTCC 
AAGTTGTTCTTATTTTCAAATGGGACTTTGTATATAAGAAACCTAGCCTCTTCAGAC 
AGGGGCACTTATGAATGCATTGCTACCAGTTCCACTGGTTCGGAGCGAAGAGTAGTA 
ATGCTTACAATGGAAGAGCGAGTGACCAGCCCCAGGATAGAAGCTGCATCCCAGAAA 
AGGACTGAAGTGAATTTTGGGGACAAATTACTACTGAACTGCTCAGCCACTGGGGAG 
CCCAAACCCCAAATAATGTGGAGGTTACCATCCAAGGCTGTGGTCGACCAGTGGAGC 
TGGATCCACGTCTACCCTAATGGATCCCTGTTTATTGGATCAGTAACAGAAAAAGAC 
AGTGGTGTCTACTTGTGTGTGGCAAGAAACAAAATGGGGGATGATCTGATACTGATG 
CATGTTAGCCTAAGACTGAAACCTGCCAAAATTGACCACAAGCAGTATTTTAGAAAG 
CAAGTGCTCCATGGGAAAGATTTCCAAGTAGATTGCAAAGCTTCCGGCTCCCCAGTG 
CCAGAGATATCTTGGAGTTTGCCTGATGGAACCATGATCAACAATGCAATGCAAGCC 
GATGACAGTGGCCACAGGACTAGGAGATATACCCTTTTCAACAATGGAACTTTATAC 
TTCAACAAAGTTGGGGTAGCGGAGGAAGGAGATTATACTTGCTATGCCCAGAACACC 
CTAGGGAAAGATGAAATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCCCGGATA 
AGGCAGAGTAACAAAACCAACAAGAGAATCAAAGCTGGAGACACAGCTGTCCTTGAC 
TGTGAGGTCACTGGGGATCCCAAACCAAAAATATTTTGGTTGCTGCCTTCCAATGAC 
ATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGGTCTTTGACCATC 
AACAAAGTGAAACTGCTCGATTCTGGAGAGTACGTATGTGTAGCCCGAAATCCCAGT 
GGGGATGACACCAAAATGTACAAACTGGATGTGGTCTCTAAACCTCCATTAATCAAT 
GGTCTGTATACAAACAGAACTGTTATTAAAGCCACAGCTGTGAGACATTCCAAAAAA 
CACTTTGACTGCAGAGCTGAAGGGACACCATCTCCTGAAGTCATGTGGATCATGCCA 
GACAATATTTTCCTCACAGCCCCATACTATGGAAGCAGAATCACAGTCCATAAAAAT 
GGAACCTTGGAAATTAGGAATGTGAGGCTTTCAGATTCAGCCGACTTTATCTGTGTG 
GCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTACTGGAAATG 
CTGAGAAGACCGACATTTAGAAATCCATTTAATGAAAAAATAGTTGCCCAGCTGGGA 
AAGTCCACAGCATTGAATTGCTCTGTTGATGGTAACCCACCACCTGAAATAATCTGG 
ATTTTACCAAAT GGCACACGATTT TCCAATGGACC AC AAAGTT AT CAGT AT CT GAT A 
GCAAGCAATGGTTCTTTTATCATTTCTAAAACAACTCGGGAGGATGCAGGAAAATAT 
CGCTGTGCAGCTAGGAATAAAGTTGGCTATATTGAGAAATTAGTCATATTAGAAATT 
GGCCAGAAGCCAGTTATTCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGA 
GAATCTCTATCACTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCAAATGG 
ACTATGCCAAGTGGTTATGTAGTAGACAGGCCTCAAATTAATGGGAAATACATATTG 
CATGACAATGGCACCTTAGTCATTAAAGAAGCAACAGCTTATGACAGAGGAAACTAT 
ATCTGTAAGGCTCAAAATAGTGTTGGTCATACACTGATTACTGTTCCAGTAATGATT 
GTAGCCTACCCTCCCCGAATTACAAATCGTCCACCCAGGAGTATTGTCACCAGGACA 
GGGGCAGCCTTTCAGCTCCACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACA 
TGGGAGATGCCTGACCACTCCCTTCTCTCAACGGCAAGTAAAGAGAGGACACATGGA 
AGTGAGCAGCTTCACTTACAAGGTACCCTAGTCATTCAGAATCCCCAAACCTCCGAT 
TCTGGGATATACAAATGCACAGCAAAGAACCCACTTGGTAGTGATTATGCAGCAACG 
TATATTCAAGTAATCTGA 



Figure 5ft 

Human OCP: predicted amino acid sequence 



MKVKGRGITC 

IPDSIPPNVE 

FSDLQALQVL 

GLNFLRLVHL 

PQEMVSYMPD 

SAQQCPLCMN 

SAFISPQGFM 

YIVLNTSFST 

YYKYKQVAPK 
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LTYAPGTVKG ISGESLSLHC VSDGIPKPNI KWTMPSGYVV DRPQINGKYI 
LHDNGTLVIK EATAYDRGNY ICKAQNSVGH TLITVPVMIV AYPPRITNRP 
PRSIVTRTGA AFQLHCVALG VPKPEITWEM PDHSLLSTAS KERTHGSEQL 
HLQGTLVIQN PQTSDSGIYK CTAKNPLGSD YAATYIQVI* 



